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4.  Assignment  by  a decision  tree  procedure. 
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DT  methods  compared  to  a distance  function  criterion. 
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(c)  weighted  dimensional  distances. 

(d)  Zero-one  count. 

6.  Comparison  of  the  techniques  in  terms  of  severity  of  .nis- 
ei as  si  fi cation,  defined  os  costs  associated  with  inappropriate 
treatment  recommendations. 

7.  Comparison  of  the  techniques  in  terms  of  information  required,  by 
(a)  eliminating  indexes,  (b)  coHupsIng  Indexes,  and  (c)  sampling 
of  respondents. 

8.  Comparison  of  the  techniques  In  terms  of  ease  of  calculation. 
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INTRODUCTION 


Mo  exact  date  can  be  designated  as  m rking  the  birth  of  organizational 
development.  Perhaps  the  late  1950's  or  early  1960's  marked  the  first  use 
of  the  specific  term.  What  has  taken  place,  therefore,  has  occurred 
within  the  last  15  yean,  years  which  have  seen  a substantial  Investment 
In  the  range  of  activities  loosely  representing  this  applied  field. 

Although  we  lack  exact  dollar  counts,  a plausible  estimate  of  the  total 
funds  Invested  in  organizational  development  must  run  to  hundreds  of 
millions  of  dollars.  By  any  standard,  this  Is  a large  amount,  one  that 
no  entity  — whether  It  be  public  or  private  — may  take  lightly. 

This  same  time  pirlod  represents  as  well  the  first  point  at  which 
It  was  conceptualized  as  organ! zatlonal  development,  as  opposed  to  manage- 
ment development  or  simply  training.  No  exact  definition  has  general 
currency,  but  the  term  Is  generally  taken  to  refer  collectively  to  an 
assortment  of  training  or  therapeutic  interventions  whose  aim  Is  presumed 
to  be  Improvement  of  the  organization  <snd  Its  members. 

Tha  reasons  for  organizational  development's  emergence  at  this 
particular  time  are  not  totally  clear.  Still,  It  no  doubt  ties  to  the  series 
of  traumatic  national  events  which  characterized  the  turbulent  1960's  and 
early  1970's--war,  assassination,  inflation,  energy  crises,  and  political 
scandal.  It  was,  in  these  short  years,  an  Intense  period  that  called  into 
question  old  ways  of  solving  problems  and  old  standards  of  behavior,  while 
It  called  for  new  ones. 

This  applied  profession's  success  In  producing  such  constructive 
changes  Is  another  matter,  however.  Whatever  its  accomplishments  rnay  have 
been,  they  have  been  poorly  documented,  wl  th  the  result  that  the  base  of 
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scdeotiflc  knowledge  in  the  field  has  been  remerably  thin.  Kahn  (1974) 
expressed  it.  nicely: 

"A  few  theoretical  propositions  are 
repeated  without  additional  data  or 
detelopnaftt  i a few  bl  la  of  honey 
atevlcf  a m rei  terated  wi  thaut  proof 
er  dlfpree#.  and  a few  stun# 
empirical  general  1 zeti  ons  are  quoted 
with  rewereaot  hut  without  refinement 
or  explication."  (p.  487) 

Me  peecpeded  to  cite  a pointed  example  of  the  redundaic  if  the  0*3. 
lltarotyee:  of  the  200  1 tana  contained  in  a prominent  bibliography „ only 
ene- fourth  iwlodwi  original  quantitative  data.  One  of  the  present 
author*  (Cower®,  19*71  *>  sited  similar  statistics.:  of  the  many  titles 
listed  for  this  field  In  tee  abstract*  for  these  15  years,,  only  18.  studies 
other  teas  doctoral  dissertations  car  be  found  which  contain  real  evidence* 
and  * telfd  of  these  are  tee  work  of  three  persons) 

The  sane  article  catalogued  a list  of  other  shortcomings  which 
orgeetaatioeal  dowel opewt  must  be  conciliated:  to  have: 

• >|wpjrftcf  ftll  & " fn^deewrtse  realism,.  Inadequate 

relevance.,  and  InadtopMtei  time  available  In 
tee  interventions  undertaken.. 

. tyteMftiaMyn  - uueradwouacy for  fanciy  fmm ,. 
ante  more  prwwfeicn  tears  cmreftil  design . 

• Hfe&feflP  .maUM&imL.  ’ S ngle  • 

weetete  ante  iwispl  acate  afrfarts  stewsing,  free 
fatete  In  «n  lk  catalytic"  ataawaa  which  tium 
eeh  to-  b»  QBntenfir  leee... 
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It  Is  particularly  to  the  last  of  these  Issues  --  the  consultant's 


role  --  that  the  present  report  turns.  An  Important  part  of  the  consultant's 
role  Is  often  presumed  to  be  diagnosis  — translating  a wide  variety  of 
symptoms  Into  a coherent  pattern  that  permits  planning  and  carrying  out 
appropriate  remedial  action.  According  to  Lawrence  & Lorsch  (1969),  the 
reasons  for  the  importance  of  diagnosis  In  organisational  development  are 
many  and  persuasive: 

(1)  The  client  system  may  not  be  aware  of  the  problem  at  all. 

For  example,  the  difference  between  present  effectiveness 
levels  and  unanticipated  opportunities,  rather  than  obvious 
difficulties,  may  be  the  "problem." 

(2)  The  client  system  may  not  be  aware  of  the  real  problem. 

(3)  A discrepancy  between  actual  and  desired  outcomes  does  not 
explain  and  account  for  Itself. 

(4)  Problem  variance  Is  likely  to  be  multiply  caused. 

(5)  Causes  are  likely  to  interrelate  in  complex  ways. 

(6)  Causes  are  likely  to  differ  greatly  In  potency,  and  what 
Is  desired  Is  a designation  of  variables  with  leverage. 

(7)  Meaning  can  only  be  given  to  causal  Information  by  casting 
It  Into  an  appropriate  configuration  against  a set  of 
principles. 

(B)  What  is  required  for  action  planning  is  an  oyerall  and 
Integrated  view,  not  a parochial  one. 

(9)  Diagnosis,  if  done  well,  provides  some  Insurance  agalns 
rushing  Into  an  inappropriate  treatment  that  may  prove 
damaging. 
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In  contrast  to  this,  the  article  cited  earlier  (Bowers,  197S)  turned 
attention  to  assumptions  concerning  the  consultant's  diagnostic  role  in 
O.D,  The  points  made  there  hear  repeating. 


. While  a number  of  wri  ters  have  attributed  a diagnostic 
role  to  consultants,  what  goes  unrecognized  is  that 
their  diagnoses  are  often  put  to  little  other  than 
heuristic  use  (that  Is,  they  are  used  merely  to 
stimulate  an  interesting  discussion). 

. An  unpublished  study  of  consultants'  diagnostic  skills 
showed  (a)  inability  to  agree  with  diagnostic  con- 
clusions more  formally  obtained,  and  (h)  more  positive 
change  occurring  where  consultants  did  relatively  little 
diagnosing  than  where  they  did  a great  deal  of  it. 

. Most  consultants  currently  employ  diagnostic  methods 
which  rely  upon  one  observer— the  consultant  himself 
or  herself— to  obtain  data.  The  N is  restricted,  not 
only  in  this  fashion,  but  also  by  the  fact  that  tills 
consultant-observer  la  limited  to  a time-bound  behavior 
sample. 

These  observations  should  not  surprise  us.  Findings  from  the  general 
field  of  assessment  and  classification  have  provided  strong  support  to 
the  position  that  stfttlstl cal  prediction  is  superior  to  non-statlstlcal 
or  Judgmental  methods  (Cronbach,  1960).  For  example.  In  Meehl's  (1954) 
major  review  of  clinical  versus  statistical  prediction,  it  was  found  that 
statistical  prediction  was  equal  to  or  superior  to  clinical  prediction 
in  19  out  of  20  cases. 
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Citing  this  body  of  accumulated  evidence,  Cronbach  explores  the 
reasons  for  perenially  poor  showings  by  (clinical)  judges: 

. Judges  combine  data  by  means  of  intuitive  weightings 
which  they  have  not  checked. 

. Judges  casually  change  weights  from  one  case  to  the 
next. 

. Judges  are  unreliable,  in  the  sense  that  the  same 
case  might  not  be  judged  the  same  way  twice  in 
successl on. 

. Judges  have  stereotypes  and  prejudices  which  affect 
their  judgments. 

His  conclusions  are  the  following: 

"What  does  this  imply?  It  implies  that,  counselors, 
personnel  managers,  and  clinical  psychologists  should 
use  formal  statistical  procedures  wherever  possible 
to  find  the  best  combining  formula  and  the  true 
expectancies  for  their  own  situation.  They  should 
then  be  extremely  cautious  in  departing  from  the 
recon.nendations  arrived  at  on  the  basis  of  the 
statistics..."  (p.  348) 

If  this  is  the  desirable  state  for  organizational  development  as  well, 
It  is  scarcely  what  in  fact  obtains.  Levinson  (1972,  1973),  in  his 
published  remarks  which  led  to  the  celebrated  exchange  with  Burke  (1973) 
and  Sashkin  (1973),  stated  that  there  Is  little  resembling  formal  diagnosis 
in  0.1),  Consistent  with  Kahn's  observations  cited  above,  Levinson  stated 
that  the  field  is  characterized  by  "ad  hoc  problem-solving  efforts  and  a 
heavy  emphasis  on  expedient  techniques."  Hchy  (1975)  does  not  reassure 
us  when  he  finds,  in  his  systematic  empirical  study,  that  change  agents 
(consultants)  seem  to  have  limited  diagnostic  perspectives,  that  their 


6 


n 


diagnostic  frameworks  are  rather  closely  limited  to  their  personal  values 
anc!  goals,  and  that  the  potential  for  intrusion  of  bias  is  not  small. 

Unfortunately,  recommended  alternatives  are  relatively  scarce. 
Levinson's  recommendati ons  build  upon  a view  and  a method  of  organizational 
diagnosis  that  i^  an  extension  of  the  clinical  case  method.  While  large 
amounts  of  empirical  data  would  be  gathered,  injecting  a clinical  judge 
between  the  data  and  the  conclusion  runs  the  risks  listed  above  by 
Cronbach . 

On  the  other  hand,  this  is  not  the  situation  nor  the  age  for  "raw" 
empiricism.  As  the  lengthy  discussions  nationally  about  discrimination 
in  testing  have  revealed,  in  the  interest  of  fairness  and  equal  treatment, 
more  must  be  taken  into  account  in  a decision  process  than  any  simple  set 
of  numbers,  especially  where  connections  between  the  numbers  and  real 
world  events  m«y  not  be  obvious.  In  a similar  vein,  the  sudden  rise  of  the 
assessment  center  concept  has  shown  that  an  appropriate  criterion  In  this 
day  and  age  (in  employee  selection,  but  by  extension  to  the  problem  of 
treatment  selection  in  0.0.)  must  include  demonstrable  connection  between 
the  measures  used  arid  the  operations  or  functions  performed  in  the  real 
organization. 

These  facts  lead  us  to  the  following  preliminary  conclusions,  which 
form  a starting  point  for  the  research  to  be  undertaken  In  the  present 
report  and  in  the  reports  which  follow  it: 

. The  base  of  scientific  knowledge  which  undergirds 
organizational  development,  while  it  is  growing 
rapidly,  is  still  remarkably  small. 
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. Much  of  what  is  done  Is  based  upon  consultants' 
predilections  or  fads,  not  upon  solid  reasons 
diagnostically  generated. 

; There  is  as  yet  little  that  could  really  be 
termed  rigorous  diagnosis  practiced  within  the 
O.D.  profession. 

. Here,  as  elsewhere,  statistical  prediction  is 
likely  to  prove  far  more  accurate  than  clinical, 
or  clinically  mediated,  prodiction. 

. Raw  empiricism,  in  the  form  of  predictors  not 
obviously  related  to  the  processes  and  functions 
being  diagnosed,  no  matter  how  seemingly  accurate, 
are  no  longer  societally  acceptable.  Prediction 
must,  be  based  upon  measures  derivable  from  solid 
scientifi  c evidence  about  organi z.ati anal 
functioning. 

To  understand  what  is  or  must  be  Involved  In  diagnosis,  we  turn  to 
a field  which  has  practiced  and  taught  diagnosis  for  .years  and  decodes, 
or  even  centuries:  medicine.  Ledley  and  Lusted  (1959),  In  what  must 
be  counted  as  a seminal  article,  dealt  at  some  length  with  the  reasoning 
foundations  of  medical  diagnosis.  Table  1 presents  a few  of  the  principal 
points  which  they  make,  along  with  organizational  diagnostic  analogs. 

In  the  next  sections  we  present  a brief  discussion  of  the  content  of  each 
point. 
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The  total  pool  of  available  characteristics  (of  client  units)  is 
at  any  time  limited  to  those  which  our  knowledge  base  contains  some  infor- 
mation about  and  which  our*  measurement  methods  are  capable  of  measuring. 
All  available  characteristics  are,  at  some  level  on  their  respective 
scales,  potential  symptoms.  Whether  they  are,  in  fact,  regarded  as 
"symptoms"  or  not  depends  upon  what  past  research  and  experience  has  found 
to  be  true  --  that  is,  what  has  been  added  to  l:he  knowledge  base. 

What,  then,  are  diseases,  disorders,  or  slates  of  organizational 
dysfunction?  A disease  is  a hypothetical  construct  — a theoretical 
term  used  for  convenience  purposes  to  refer  to  a whole  chain  of  physical 
events  which  are  hypothesized  as  having  occurred.  "Proof"  that  the 
hypothesized  sequence  has  occurred  (or  is  occurring)  is  obtained  by  some 
form  of  validation  process.  This  validation  can  be  concurrent  or  even 

retrospective:  if  little  .lohnny  has  Influenza,  he  should  display 
particular  additional  characteristics  or  should  have  displayed  them  within 
the  last  24  to  48  hours.  It  can  also  take  the  form  of  construct 
validation,  that  is,  of  showing  that  only  those  observables  that  are 
hypothesized  as  going  together  In  fact  appear.  Finally,  the  validation 
process  can  be  predictive:  we  can  wait  to  see  whether  subsequent, 
predicted  signs  of  Influenza  appear  in  little  Johnny's  case.  Throughout 
this  sequence  of  comparisons,  however,  "Influenza"  is  a hypothetical 
sequence  of  events  which  we  presume  to  be  able  to  see  : pacific  signs 
of  at  specific  points  In  time.  Its  excellence  as  a classification 
category  at  any  given  point  in  the  profession's  development  Is  entirely 
dependent  upon  the  quality  and  completeness  of  the  knowledge  base  from 


which  we  w rk,  as  It,  relates  to  the  distinctions  between  this  category 
and  others. 
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What,  than,  determines  what  a disease  is?  It  Is  the  generalizatl  m 
and  codification  processes  wMch  past  knowledge  generators  have  gone 
through  In  Integrating  the  findings  from  research  and  experience. 

Diagnostic  procedures  which  rely  upon  "expert"  assignment  to  diagnostic 
categories  simply  substitute  the  expert  clinician  for  more  public  and 
repllcatable  listings.  If  the  experts'  procedures  are  unreliable,  their 
classification  is,  as  a criterion,  worthless.  If  they  are  reliable  ant> 
valid.  It  is  a valuable  aid  — a shortcut  to  employing  the  knowledge 
base  directly  and  In  Its  entirety. 

Regardless  of  the  way  in  which  we  mediate  the  process  by  which  the 
knowledge  base's  contents  get  represented,  the  disease,  disorder,  or 
dysfunction  Is  nothing  other  than  a string  of  symptoms  very  much  like 
those  which  we  look  at  in  any  particular  case.  It  is  to  this  hypothetical 
symptom  string  or  pattern  that  comparison  is  made  in  a diagnosis. 


lility  Statements 


In  organizational  development  and  change,  the  diagnostic  process 
follows  essentially  this  same  pattern.  Symptoms  are  organizational 


characteristics  which  past  research  Indicates  ::o  together  to  define  sane 
more  general  statement  of  organizational  health  or  (dysfunction.  That 


our  "diseases"  not  have  exotic  names  in  Latin  should  not  dismay  us. 


Perhaps  the  absence  of  names  at  all  is  an  advantage,  in  fact.  Certainly 
there  have  been  fewer  year.,  and  resources  available  as  yet  for  the 
codification  of  the  knowledge  base,  and  our  professional  schools  teach 
us  to  be  hesitant,  cautious,  and  qualifying  in  ou>  statements,  rather 


than  authoritative,  definitive,  and  final.  These  are  Issues  of  style, 
however,  rather  than  substance.  The  fact  remains  that  there  is  an 
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existing  knowledge  base,  comparison  to  which  permits  us  to  make  a 
probability  statement  concerning  any  case  at  hand.  Here,  as  In  medicine, 
a diagnostic  statement  Is  a "best  guess." 

Relevance  to  Treatment  Selection 

The  whole  purpose  of  a diagnosis  is  to  permit  the  selection  of  an 
optimum  treatment  or  intervention.  Here,  as  in  the  case  of  medicine, 
such  choices  are  subject  to  social,  ethl cal , economic,  and  moral 
constraints  imposed  by  the  society  in  which  we  live.  Certain  interventions 
may  be  socially  unacceptable  or  even  morally  offensive.  For  example. 
Intensely  confrontational  techniques  are  clearly  unacceptable  in  many 

more  traditional  organizational  settings,  and  under  certain  circum- 
stances It  is  conceivable  that  top  management  team  development  training 
might  generate  an  In-group  clubbiness  whose  effects  are  racially 
discriminatory  and  therefore  morally  offensive.  Other  interventions, 
no  matter  how  appropriate  and  promising,  might  be  so  expensive  as  to  be 
prohibitive,  while  still  others  that  would  solve  the  problem  might 
lead  to  violations  of  privacy  and  confidentiality  which  must  be  judged 
to  be  unethical. 

However,  within  the  limits  which  these  constraints  Impose,  the 
problem  bacomas  one  of  selecting  an  optimum  treatment  from  a pool  of 
those  available.  What  is  optimum?  Led  ley  and  Lusted  (1959)  turn  to 
value  (decision)  theory  In  an  attempt  to  answer  tills  question.  Bowers 
and  Hausser  (1977)  have  shown  how  the  organizational  development  problem 
can  Itself  be  cast  Into  these  same  terms,  and  have  presented  empirical 
evidence  about  a limited  number  of  Intervention  strategies. 


uIjV  h-.fc. 
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A diagnostic  prooadure  which  clearly  differentiates  cases  to  which 
each  of  the  known  and  available  interventions  are  appropriate  would 
obviously  be  superior  to  one  which,  in  some  measure  or  other,  was  unable 
to  distinguish  a condition  calling  for  one  intervention  from  a condition 
calling  for  another.  At  the  most  undesirable  extreme  would  be  a 
"til agnostt c"  procedure  whose  conclusions  lead  always  to  the  same 
treatment  or  Intervention,  a condition  which  Levinson  (1972)  implied 
occurs  in  organizational  development  all  too  frequently. 


Role  of  the  Knowledge  Base 

Even  wi  th  a relatively  simple  ra-lng  system  of  "Yes-No"  or 
"Present- Absent,"  a list  of  N possible  characteristi cs  produces 
potential  combinations.  The  number  of  potential  "diseases"  or  dysfunc- 
tional states  --  represented  by  the  number  of  cells  in  an  N-dimenslorial 
lattice  --  Is  obviously  unreal  istically  large.  In  any  comprehensive 
schema,  all  of  the  aval  table  units  in  the  world  probably  would  be 
Insufficient  to  providing  a single  case  per  cell.  The  equally  obvious 
conclusion  Is  that  most  cells  are  empty,  that  they  represent  nonexistent 
disorders,  and  that  only  a relative  few  comprise  the  set  of  "real" 
possibilities.  It  Is  the  task  of  the  knwiedge  base  to  provide  us  with 
current,  accurate  Information  about  what  these  possibilities  are. 

Much  the  same  point  is  made  in  the  theory  of  adaptation  in 
natural  and  artlflcia1  systems  (Holland,  1975).  Combinations  of 
characteristics  tap  idly  generate  astronomical  numbers  of  possibly 
adaptive  structures.  If  the  organism  or  system  were  to  choose  an 
enumeratlve  adaptive  plan  --  simply  running  down  the  list  randomly 
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until  It  found  the  one  that  worked  — adaptation  would  rapidly  become 
Impossible.  As  the  writer  just  cited  Indicates,  given  even  the  fastest 
computers  in  existence.  It  would  require  "a  time  vastly  exceeding  the  age  of 
the  universe  to  test  101®0  structures."  Instead,  adaptive  plans  to  b« 
feasible  must  be  robust  — that  is,  they  must  be  efficient  over  the  range 
of  situations  which  will  be  encountered.  One  general  requirement, 
therefore.  Is  that  an  adaptive  plan  must  retain  advances  already  made, 
as  well  as  parts  of  the  history  of  what,  has  occurred.  This  Information, 
of  course.  Is  what  constitutes  the  knowledge  base  In  any  diagnostic  system. 


Improvement  Probabilities  and  the  Single  Case 

At  first  blush,  the  statement  seems  unfeeling  or  insensitive  that 
we  maximize  the  probability  of  any  Individual  unit's  showing  Improvement 
when  we  apply  to  It  a strategy  shown  to  maximize  the  number  of  uni ts 
showing  Improvement.  Organizational  development  is,  after  all,  a human 
practice  profession,  and  It  seems  Impossible  to  Ignore  facts  obviously  at 
hand  (within  range  of  our  personal  observations,  for  example). 

Nevertheless,  observations  based  upon  an  N of  one  (observer), 
collected  under  atypical  conditions  and  within  non rep resent a tl vely  short 
time  frames,  are  no  more  reliable  and  accurate  when  taken  singly  than 
they  would  be  If  used  en  masse  for  large  numbers  of  cases. 

This  Issue  was  touched  upon  by  one  of  the  present  writers  In  an 
earlier  report:  "F.ven  the  most  accurate  diagnosis  may  suffer  from  mid- 
stream or  horseback  revisions  made  by  the  consultant  as  he  approaches  Its 
use.  Basically,  any  data  collection  and  analysis  method  treats  with  some 
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<feg***  of  eore  and  accuracy  a portion,  but  not  all,  of  the  behaviors, 
outmte,  and  Issues  in  the  life  space  of  the  client  system.  Some  portion 
1-s  urrlqtj*  to  that  system,  ov  to  any  group  within  It,  or  will  have  been 
excluded  from  the  array  of  information  categories  desired  in  the  diagnostic 
process  at  Its  inception.  As  the  consultant  approaches  a particular  urit 
or  group  of  the  client  system,  he  will  necessarily  see  other  aspects  of 
whet  he  feels  are  Its  functioning  not  represented  in  the  diagnosis  which 
he  h«s  in  hand.  Since  he  Is  dealing  with  a real  client.  In  a real  world 
situation,  the  tenptatlon  Is  well  nigh  irresistable  to  revise  the  diagnosis 
on  the  basis  of  his  current  observation.  Yet,  he  is  one  observer  observing 
at  hast  a limited  and  time-bound  behavior  sample.  To  the  extent  that 
he  mokes  such  revisions  he  therefore  very  likely  reduces  both  the 
reliability  and  the  validity  of  the  diagnosis  with  which  he  works.  Said 
otherwise,  he  approaches  each  group,  or  each  setting,  a-  a unique  Instance 
with  live  people  and  real  problems.  Yet  In  many  ways  the  dlagnos.s  and 
treatment  problem  in  organizational  development  is  a “large  N"  problem. 

he  to  work  on  the  basis  of  the  diagnostic  data  provided  to  11m  and 
that  alone,  given  that  it  is  reliable  and  valid,  he  would,  across  a large 
number  of  cases,  succeed  In  a high  portion  (assuming  that  the  diagnostic 
and  prescription  processes  are  themselves  high  In  quality,  reliable,  and 
valid).  Y^t  he  does  not  ordinarily  approach  his  role  with  that  degree 
of  abjective  detachment,  and  each  tlm#  that  he  yields  to  the  temptation 
to  revise  on  th*  basis  of  "current  reality"  he  submits  himself  to  a situation 
In  which  his  action  steps  are  based  on  laws  than  acceptably  reliable  and 
valid  data."  (Bowers,  1974) 
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Toward  Relevant  Research 

Clearly,  therefore,  any  attempt  to  develop  and  test  more  rigorous 
diagnostic  procedures  In  organisational  development  should  be  based  upon 
a model  containing  principles  of  organizational  functioning.  In  other 
words.  It  should  be  theoretically  anchored  to  a conceptual  statement  ihat 
Is  Itself  both  organizational  In  content  and  comprehensive  in  scope. 

While  the  literature  on  organizational  management  Is  ripe  with  theoretical 
statements,  most  of  them  do  not  meet  criteria  of  acceptability  for  our 
present  purposes.  Many  must  be  dismissed  as  less  comprehensive  than  Is 
necessary  for  the  present  problem:  that  Is,  they  are  elegant  treatments 
of  an  isolated  issue  such  as  job  design  or  individual  satisfaction  or 
leadership,  but  they  Ignore  other  areas.  Others  may  be  rejected  because, 
although  they  encompass  most  of  the  domain,  they  are  lacking  in  adequate 
empirical  underpinnings.  Howe'er,  one  theoretical  statenent  which  does 
appear  to  meet  the  criteria  Just  outlined  Is  the  Likert  meta-theoretl cal 
paradigm.  (Likert,  1961,  1967,  1976;  Bowers,  1976)  It  Is  this  theoretical 
statement  which  underlies  the  measures  collected  In  the  data  bank  to  be 
used  In  the  research  launched  by  this  project. 

Most  recent  evidence  suggests  that  this  paradigm  assumes  the  form 
taken  in  Figure  1 (Bowers  4 Franklin,  1977).  As  o set  of  principles,  this 
paradigm  would  appear  to  satisfy  the  criteria  of  conv>rehensive$  »nd 
evidential  validity.  (Bowers  & Franklin,  1977;  Likert*  1977).  It  Is 


operationalized  here  In  the  form  of  the  Surv 


Ions,  a machlne- 


s cored  standardized  questionnaire  which  has  been  used  In  various  editions 
since  ^966  to  collect  organizational  survey  data  for  assessment,  feedback, 
and  bet  chmark  purposes  (Teylor  A Bowers,  1972).  Portions  of  these 


« i Vi,  nil  rraw i r»i  inieiilrf  Tiiaaie<wMaiiei'iin'».tvi 


17 


banked  data  have  been  used  In  earlier  research  efforts  related  to 
organizational  development.  In  this  regard*  a method  of  diagnostic 
classification  was  previously  developed  and  preliminarily  tested. 

Termed  CANOPUS , It  contains  a software  package  designed  to  generate  a 
diagnostic  state  ment  for  groups  and  pyramids  of  groups  comprising 
organizations  (Bowers,  1974).  This  classification  method  is  based  upon 
a typology  of  work  groups  developed  In  the  cour:  e of  prior  research. 

The  technique  used  for  the  development  of  the  typology  was  profile 
analysis,  in  which  one  arrives  at  a clustering  of  work  groups.  The 
profile  consisted  of  a group's  scores  on  the  SOO  Indexes  and  as  a 
profile  reflected  three  basic  kinds  of  Information:  level,  dispersion, 
and  shape.  Level  was  the  mean  score  of  the  work  group  over  the  indexes 
In  the  profile;  dispersion  reflected  hew  widely  scores  In  the  profile 
diverged  from  the  average;  and  shape  concerned  the  profile's  high  and 
low  points. 

A measure  of  profile  similarity  that  takes  shape,  level  and  dis- 
persion Into  account  is  the  distance  measure.  If  one  considers  a person 
(or  group)  as  a point  In  a multidimensional  space  In  which  each  dimension 
represents  a variable  or  index,  then  the*  distance  between  two  points, 
that  Is,  persons  (or  groups),  can  be  computed  using  the  generalized 
Pythagorean  theorem.  The  distances  can  then  be  examinee!  to  determine 
whU;h  groups  cluster  together  in  that  multidimensional  spac«\ 

The  clustering  technique,  called  hierarchical  grouping,  uses  this 
distance  measure  as  a measure  of  profile  similarity,  ronputer  software 
Is  available  for  this  technique  In  the  program,  HGROUP  (Veldman,  1967.) 
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Tfrfs  p*e#)nMP  begins  by  considering  each  original  object.  In  this  case 
* rnrk  grdkP.  of  these  to  be  clustered,  ?<s  a cluster.  These  h clusters 
wen  the*  reduces!  In  number  by  a series  of  step  decisions  until  all  N 
objects  how*  becm  classified  Into  one  or  the  other  of  two  clusters. 

At  e#«h  step  tha  nuntoer  of  clusters  Is  reduced  by  one  through  combining 
some  pair  of  cluster*.  The  particular  pair  to  be  combined  at  any  step 
H determined  by  the  computer's  examining  all  the  available  combinations 
and  choosing  the  one  which  minimally  Increases  the  total  variance  within 
clusters.  It  Is  this  latter  minimizing  function  that  utilizes  the  dis- 
tance notion.  The  total  variance  within  cluster's  Is  a measure  of  the 
closeness  of  the  points  In  multivariate  spate  In  clusters  already 
chosen,  A substantial  increase  in  this  variance,  which  the  H GROUP 
prwgyiW*  l*b«U  an  error  term.  Indicates  that  the  previous  number  of  cluster's 
ii  probably  optimal  for  the  original  set  of  objects  or  work  groups, 
fhe  program  provides  an  Identification  of  those  groups  contained  In  each 
cluster  so  that  further  analyses  can  bo  conducted  on  phenomena  within 
clusters. 

The  HfifKHA*  program  was  applied  to  three  random  subs ample*  drawn 
from  the  debt  bank  (Hawser  & Bowers,  1977.)  When  the  three  sets  of 
data  were  considered  jointly,  a total  of  17  distinct  profiles  emerged. 

In  many  ways . this  software  system  would  appear  to  meet  generally  the 
requirement*  listed: 

. It  coepom  data  to  appropriate  norm. 

. Problem*  once  identified  are  prioritized  in  terms  of 
their  potential  impact  upon  outccws. 

, It  seeks  com&o*  for  observed  conditions  among  situa- 
tional, information,  skill  and  values  conflict 
predictors , employing  a distance  statistic. 
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. It  selects  a broad  set  of  potentially  appropriate 
action  steps  from  an  array  of  possibilities. 

I!  iitMVril’.  I.lii'  whole  mid  ll‘.  p.irl.1.  into  .<  n*.ulnhir 
narrative  by  computerized  text-writing. 

Still,  the  method  Is  based  upon  the  measures,  and  those  measures 
derive  from  the  theoretical  paradigm  previously  cited.  While  attractive, 
it  Is  but  one  of  sevei  al  statements  that  might  have  formed  the  basis  for 
operations  and  measures.  Clearly  some  difference  among  theorists  Is  to 
be  expected.  The  domain  is  sliced  differently,  and  the  terms  applied  to 
collections  or  clusters  of  behaviors  and  processes  will  vary  substantially. 
However,  if  the  fundamental,  general  algorithm  is  the  same,  we  can  at 
least  be  somewhat  reassured  that  subsequent  work  will  not  be  unacceptably 
parochial . 

In  an  effort  to  address  this  question,  the  writings  of  nearly  30 
prominent,  persons  1 n the  field  (Including  Likert)  were  examined.  While 
no  effort  was  made  to  consider  all  possible  positions  taken  by  every 
conceivable  writer,  an  effort  was  made  toward  at  least  reasonable 
comprehensiveness.  All  of  those  considered  were  concerned  In  one  way  or 
another  with  organizations,  and  all  had  demonstrable  action  Interests. 

Some  were  discarded  after  a brief  scrutiny  for  the  reason  that  their 
Interest  appeared  to  be  nar-organl  rati  onal , that  Is,  that,  for  example, 
the  outcome  valued  was  personal  growth  or  individual  adjustment,  not 
organizational  effectiveness-  Others  were  discarded  because  their  writings 
were  restricted  almost  totally  to  the  operation  of  a particular  technique 
in  a limited  environment  (management  by  objectives,  for  ^xanple) . In 
the  end,  more  careful  consideration  was  given  to  the  writings  of  11 
theorists  or  pairs  of  theorists.  (See  Table  2.) 


22 


I 


11 


r 


v | 


m 


At  first  blush,  the  position's  represented  by  these  persons  world 
Appear  to  be  diverse.  Merzberq,  after  all,  is  primarily  oriented  toward 
job  enrichment,  while  McGregor's  position  was  one  focused  around  the 
behavioral  effect  of  personal  values.  However,  a closer  look  reveals  that, 
with  minor  variations,  the  same  general  algorithm  underlies  most,  if  not 
all.  While  It  varies  greatly  in  form,  and  considerably  in  the  exact 
i ferns  noted,  recorded,  or  counted,  it  assumes  that  something  external 
to  the  organism  (group,  group  member,  br  organizational  unit),  that  is, 
something  in  its  environment,  combines  or  Interacts  with  something 

I 

internal  to  the  organism.  This  process  leads  to  states  of  internal 
functioning  on  the  part  of  the  organism  that  in  turn  result  In  effectiveness. 
Stated  at  a highly  general  level.  It  can  be  seen  as  an  expression  of  the 
old  Lewi ni an  equation,  B = f(P*E). 

Some  dlffeiance  among  theorists  obviously  occurs  around  the  issue 
of  aggregation  of  Individuals  Into  collectivities  such  as  groups.  Some 
(as,  for  examr  le9  Lavfler  or  Herzbeeg)  treat  individuals  separately  and 
integrate  afterward,  sometimes  by  an  implicit  summation.  Others  (for 
example,  Lawrence  & Lorsch  or  Likert)  aggregate  first  and  deal  principally 
with  group-level  processes  and  results.  In  the  Ugh'  of  the  similarity 
of  the  general  algorithm,  however,  this  difference  seems  comparatively 
minoi , 

Let  us  consider  in  greater  detail  each  of  the  components,  environ- 
ment and  person.  Turning  first  to  the  environment  segment.,  words,  terms 
and  phrases  vary  widely,  but  nearly  all  of  the  theorists  concerned  seem 
to  focus  upon  the  following  set  of  interrelated  Issues: 
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' . information  flows,  proces  es,  and  patterns 

• motivational  conditions,  structures,  and  systems 

' 

. task  configurations,  structures,  or  flows 

• norms , values,  atti tudes  , and  beliefs  stemming  from 
superordinate  governance  systems 

- . distribution  of  power t particularly  for  resource 

allocation 

. technical  or  physical  conditions,  or  configurations 
\ ***— *— 

of  objects 

The  person  portion  of  the  algorithm  is  variously  represented  in  the 
different  theoretical  treatments , but  appears  to  deal  in  the  following: 

• an  af fecti ve  component , in  the  form  of  values,  needs, 
or  personality  orientation 

• a cogni tive  component.  In  the  form  of  knowledge, 
ability,  information,  expertise,  and  expectations. 

. a behavioral  potential  component,  in  the  form  of 
competence  and  skills. 

In  some  Instances,  important  modifications  of  one  or  more  of  these 
are  assumed  to  be  represented  by  such  personal  background  descriptors  as 
age  or  education.  Even  where  this  Is  so,  however,  it  is  clear  that  It 
Is  the  affective,  cognitive,  and  behavioral  Implications,  rather  than 
sheer  demographic  facts,  that  are  field  to  be  Important. 

Thus  the  algorithm  might  be  restated  in  somewhat  more  elaborate 
fashion  as  it  appears  In  Figure  2.  The  problem,  of  course,  becomes 
more  complicated  when  extended  to  those  groups  of  groups  called  whole 
systems.  Outputs  from  one  group  become  environmental  situations  forming 
inputs  to  other  groups,  (Bowers  & Spencer,  1977). 


Fi§»re  2 
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Nevertheless , we  come  away  from  this  examination  of  the  field 
reasonably  confident  that  a common  algorithm  underlies  most  of  the  major 
works  In  It,  It  leaves  us  reassured  that  adherence  to  an  alternative 
formulation  from  this  list,  if  pursued  to  Its  most  basic  form,  would  not 
result  in  an  utterly  different  diagnostic  scheme.  Terms  might  be 
different,  and  the  operations  employed  by  each  writer  to  measure 
particular  sets  of  variables  might  vary  widely,  but  the  rationale  and  the 
set  of  primitive  constructs  would  be  very  much  the  same. 


A Stock-Taking  and  Some  Impli catlo.  s_ 

Against  the  expressed  need  for  Improved  methods  for  diagnosis  in 
organizational  development  we  can  array  the  following  major  points  from 
the  preceding  discussion.  An  adequate  diagnostic  procedure  necessitates: 

(1)  A theoretical  model  which  is  acceptably  comprehensive, 
which  shares  the  same  general  algorithm  present  in  the 
array  of  principal  alternative  formulations. 

(2)  A bank  of  data,  collected  by  a standardized  Instrument 
In  a wide  variety  of  organizational  settings,  both 
military  and  civilian. 

(3)  A recognition  that  accuracy  In  diagnosis  will,  here  as 
elsewhere,  very  likely  be  enhanced  by  statistical  opera- 
tions rather  than  clinical  Judgment. 

(4)  An  acknowledgment  of  societal  requirements  rejecting  "raw" 
empiricism  In  favor  of  statistical  procedures  and  measures 
which  are  content  valid. 
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ddirftarlng  the  magnltuda  of  the  problem,  the  size  of  data  sets, 
ifhd  fife  turnaround  time  requirements  present  n most  organization  (fevc’lop- 
mint  situations,  yet  another  requirement  wou1 1 appear  to  be  present: 
tffftt  whatever  operations  result  be  computer-assisted.  In  this  area,  one 
ttdft  profit  from  the  experience  of  another  practice-oriented  profession 
whgfcl  researchers  have  explored  computerized  diagnostic  procedures, 
ptyehlitry. 

There « e»  elsewhere,  substantial  difference  of  opinion  exists 
contittmlny  the  best  method  of  evaluating  the  Inportence  of  symptoms. 

T'Jiti  yftnerai  types  of  models  relying  upon  probability  statistics  have 
MOn  proposed: 

(1)  A discriminant  function  model.  In  which  each  symptom  Is  given 
«n  empirically  derived  weight,  and  an  artificial  measure  Is 
then  obtained  as  the  sum  of  the  weighted  values  (Crooks, 

Wumy  I Meyne,  1959). 

(?)  A Bayes  I an  or  frequency  -count  model.  In  which  the  relative 
frequency  of  occurrence  of  each  possible  symptom-dlseaae 
pattern  is  considered  (Ledley  l Ltistod,  VT59) . 

To  these  ties  been  added  a third  method: 

(1)  Treats  the  Issue  os  a decision*- tree  problem, " thus  relying 
taiMmnlly  upon  excellence  of  the  knowledge  base  end  not  at 
all  upw  probabilities  In  a develepaental  sample  (Spltzar 
t endicott,  1968) . 
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The  Thrtg  Methods  In  Detail 

Multiple  Discriminant  Method:  This  method  uses  a large  developmental 
sample  of  previously  diagnosed  cases  and  is  especially  suited  for 
quantitative  data.  It  takes  account  of  correlations  among  the  measures  and 
finds  for  each  clinical  group  In  the  developmental  sample  a set  of  weights 

» 

derived  so  as  to  maximize  each  subject’s  likelihood  of  being  assigned  to 
the  group  from  which  he  came.  In  subsequent  use,  each  new  case  is  assigned 
to  the  group  fpr  which  the  composite  score  represents  the  least  distance. 
The  discriminant  function  model  uses  a diagnostic  equation  of  the 
following  type: 

Y ■ aXi  + bX2  + cX3  + ...  + px^ 

where  the  coefficients  a,  b,  c,  ...  , are  weights  proportional  to  the 
magnitude  of  the  relationship  between  the  particular  symptom  and  the 
disease  process,  reduced  according  to  the  relationship  of  this  symptom 
to  tho  others  represented  in  the  equation,  and  Xlt  Xtl  etc.  are  the 
observed  symptom  values. 

\ 

The  arguments  in  favor  of  a multiple  discriminant  model  are  at  least 
threefold: 

(1)  It  better  replicates  the  thought  process  employed  by  the 
human  diagnostician,  who  does  not  treat  each  symptom  In 
present/ absent  fashion,  but  rather  attaches  greater  or  less 
weight  to  each  symptom  according  to  his  past  experience, 

(1 .e. , his  own  version  of  the  knowledge  base.) 

(2)  Symptoms  which  correlate  highly  with  the  presence/absence  of 
a disease  are  given  more  weight  than  those  that  have  shown 
little  or  no  correlation  to  Its  presence/ absence. 
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(3)  Appropriate  weighting  also  depends  upon  a symptom's  correlation 
with  other  symptoms.  If  the  overlap  is  high,  then  one  would 
weight  the  second  symptom  much  lower  than  would  be  the  case 
if  the  two  symptoms  have  little  relationship  to  each  other. 

At  least  two  objections  have  been  raised  to  this  method: 

0)  It  relies  upon  the  accumulation  of  a large  developmental 
temple  of  cases,  which  Is  difficult,  expensive,  and  In  roost 
instance*  unlikely. 

(2)  It  capitalizes  upon  accidental  features  of  the  developmental 
u ample  and  thus  gives  an  Inflated  estimate  of  Its  accuracy. 

If  the  valldatlun  sample  comrs  from  a somewhat  different 
population,  the  drop  In  efficacy  Is  even  greater. 

Still,  the  method  has  been  explored  and  developed  (Rao  I Slater, 

1948;  Melrose,  et  el.,  1970;  Sletten,  et  el.,  1971).  It  Is  closely  aligned, 
although  not  Identical,  to  the  method  employed  by  our  own  effort  at 
computerized  organizational  dlagnsQs,  which  employs  the  distance  statistic 
(Q)  In  a hierarchical  grouping  procedure. 

on  Method:  This  method  also  uses  a large  develop- 
mental sample  of  previously  diagnosed  cases  to  determine,  for  each  diagnostic 
category,  the  relative  frequency  of  each  possible  profile  of  scores. 

Its  fundamental  diagnostic  formula  Is; 

E — fr(Q  — *f) 

whore  & 1|  scientific  knowledge;  £ Is  a complex  of  syiqptom;  and  f is  a 
complex  of  diseases.  It  rosy  be  read,  "Existing  scientific  knowledge  E 
Implies  that,  If  symptom  complex  £ is  In  evidence,  the  patient  very  probably 
has  disease  f." 
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The  knowledge  best  Is  thus  stated  as  a series  of  conditional  probabili- 
ties, os  the  form  P {G  j f),  l.e.,  the  probability  of  a patient's  displaying 
the  symptoms,  given  that  the  disease  Is  present,.  It  may  be  ''converted"  to 
the  more  diagnostically  useful  form  P (f  | G)  — the  probability  of  a 
patient's  having  the  disease,  given  that  the  symptoms  are  present  — by 
Bayea'  formula.  By  this  formula,  the  probability  of  a patient's  having 
a particular  disease,  given  that  he  has  a particular  complex  of  symptoms. 

Is  equal  to: 

the  probability  of  the  disease's  occurring  In 
the  population  at  all,  multiplied  by 
the  probability  that  a patient  will  display 
this  particular  complex  of  symptoms,  given  that 
the  disease  Is  present,  and  this  result  divided 
by 

. the  summation  over  all  disease  complexes  of  the 
probability  of  the  disease  complex  multiplied  by 
the  probability  that  a patient  will  display  this 
particular  conplex  of  symptoms,  given  that  (each 
Individual)  disease  complex  Is  present. 

Thus  knowledge  base  statistics  and  symptomatic  Information  from  the 
current  case  can  be  combined  .0  establish  the  needed  diagnostic  probability. 
A subject  with  a given  profile  is  then  assigned  to  the  group  for  which 
his  profile  Is  relatively  the  most  frequent. 

The  method  Is  f. peel  ally  suited  for  categorical  or  nominal  data 
expressed  In  mutually  exclusive  categories.  It  can  also  be  appied  to 
numeric  data  by  grouping  scores  Into  Intervals  and  treating  each  Interval 
as  a qualitative  category.  Ignoring  both  Its  original  quantitative  value 
and  Its  ordinal  position. 
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The  prlndpel  argument  raised  In  favor  of  a Bayesian  approach  to 
coifutarl zAd  diagnosis  appears  'o  be  that  It  also  Is  claimed  to  model  the 
bmm  judgment  process  by  which  symptoms  are  converted  Into  a diagnostic 

statement  (that  Is,  that  th«  physician,  for  axavple,  amp  lays  a conditional 
probability  Judgment  process  In  arriving  at  a diagnosis.) 

The  objections  are  a bit  more  extensive: 

(1)  It  is  difficult.  If  not  Impossible,  In  diagnostic  work  to 
satisfy  the  conditional  independence  requirement  (the  require- 
ment that  the  probability  of  finding  one  particular  symptom 
given  that  the  disease  Is  present,  Is  unaffected  by  the 
presence  or  absence  of  any  other  symptom.) 

(2)  As  In  the  other  statistical  method.  It  requires  the 
accumulation  of  a large  developmental  sample  of  appropriate 
form  and  content. 

(3)  The  necessary  assumption  that  the  diseases  are  mutually 
exclusive  may  not  hold. 

<4)  As  In  the  other  statistical  iwthod,  It  capitalizes  upon 
accidental  features  of  the  developmental  sample. 

Regardless  of  these  abjections,  the  method  has  been  explored,  developed, 
and  experimentally  Implemented  in  psychiatry  (filmbaun  & Maxwell,  I960, 
Overall  t ftarhew,  1963;  Serith,  1^66). 

0ec*j1<*i»Tna«  Wha  logical  decision  tree  method  starts  with  n 

series  of  paaatiens , each  of  which  Is  treated  as  true  or  false.  The  true 
or  falee  mptxtse  to  each  question  rules  out  one  cr  more  diagnostic 
possibilities  ami  determine*  which  question  is  to  be  answered  next.  The 
questions  may  specify  the  pneaowa  of  a single  symptom,  the  existence  of 
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a nuneric  score  In  a certain  range,  or  a complex  pattern  of  symptoms  and 
scores.  The  method  ordinarily  results  in  the  subject's  being  assign*;  1 to 
a single  diagnostic  category,  with  no  quantitative  measure  of  similarity 
to  either  that  category  or  other  categories. 

The  arguments  In  fivor  of  tills  method  are  given  by  at  least  one 
proponent  (Spltzer,  et  al.,  1974)  as  the  following; 

(1)  It  1$  Independent  of  any  specific  body  of  data;  thttt  Is,  It 
does  not  require  a large  developmental  sample. 

(2)  It  Is  not  constructed  so  as  to  be  optimal  for  any  one 
population  ?/id  for  this  reason  "travels  well"  from  one  setting 
to  another. 

(3)  As  In  the  case  of  each  of  the  other  two  methods,  it  is  thought 
to  represent  optimally  the  thought  processes  of  the  human 
diagnostician. 

The  objections  are  the  following; 

(I)  It  Is  quite  dependent  upon  tire  accuracy  of  the  theory  which 
underlies  the  decision  * ree  and  is  therefore  ultimately  as 
dependent  as  the  other  methods  upon  past  data  accumulations, 
their  care  and  form. 

U)  Its  general  Uabl  11  ty  may  be  more  apparent  than  real. 

(3)  Its  assumption  that  the  diseases  . re  mutually  exclusive 
may  not  hold. 

Approaches  following  this  method  have  been  developed  and  applied  by 
a«w*r*l  Investigators,  with  a variety  of  results  (Spltzor  & F.nd1cott,  119613; 
Ming,  1970). 
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Save  tl  efforts  have  bean  muiertfikken  In  psychiatry  to  coop  am  two  or 
mmm*  of  that#  mthAMte  emplrlcilly.  The  results*  are  bast  described  as 
dtotfdedly  unclear.  Overall  and  Hollister  (1964)  conducted  on*  such  com- 
perUen,  but,  unfortunately , the  rulat  used  by  their  computer  programs 
Mil  obtained  from  diagnostic  stereotypes  provided  by  experts,  rather  than 
from  observed  characteristics  of  actual  cases. 

Mblrese,  et  el.  (1970)  compared  a multiple  discriminant  with  o 
decision- tree  approach  and  found  that:  (a)  on  single  assignments  the 
decision*- tree  approach  showed  a greater  degree  of  agreanent  with  an  expert 
judgment  criterion;  (o)  If  first,  second,  or  third  pussiblv  as;.lyiiineoh, 
were  al  lowed,  the  multiple  discriminant  method  showed  a greater  degree  of 
aynHHWfflt  than  did  the  dedal  on- tree;  and  (c)  In  ary  event,  each  method 
performed  better  for  certain  diagnostic  categories. 

Finally,  Fltlss,  et  al.  (1972)  unrpared  all  three  methods  and  found 
none  of  the  three  to  b.»  clearly  superior  to  the  other  two.  Again,  however, 
the  criterion  wes  agreement  with  expert  diagnoses,  a criterion  whose 
unreliability  the  authors  duly  note. 

fl-Pj 

The  work  from  psychiatry,  Just  cit*dv  contains  a dilemma  whose  existence 
UMestlons  the  whole  body  of  findings  and  whose  resolution  might  be  seen 
es  remtaiing  the  whole  exercise  rather  trivial.  Elegant,  repll eatable, 
and  readily  transported to  methods  ere  rfculgned  and  tested  against  * 
criterion  <rf  "Judgment  by  expert  clinicians."  Yet  here,  m elsewhere,  the 
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Cronbach  warnings  apply:  expert  clinical  judgment  is  notoriously  unreliable. 
What  has  been  developed,  therefore,  are  three  elegant  ways  of  replicating 
an  unreliable  procedure.  On  the  other  hand,  had  a reliable,  repll eatable , 
transportable  procedure  existed  for  use  as  a criterion,  It  would  no  doubt 
have  been  more  sensible  to  employ  It  as  the  diagnostic  method,  rather  than 
as  « criterion  for  other  methods. 

Sevoral  implications  stem  from  this  observation.  First,  where  In 
psychiatry  expert  clinical  judgment  is  an  unreliable  classification  method 
and  criterion,  in  the  present  Instance  we  do  possess  a reliable, 
verifiable  procedure,  one  based  upon  the  distance  statistic.  Obviously, 
if  accuracy  Is  Interpreted  in  distance  terms,  no  alternative  procedure 
can  be  as  accurate  as  that  self  same  distance  statistic.  If,  therefore, 
this  were  the  sole  or  major  Issue  to  be  researched,  the  project  would 
»,id  Immediately. 

It  is  not,  however;  other  Issues  of  equal  or  greater  Importance  occur, 
the  answers  to  which  are  In  no  way  obvious.  These  issues,  tc  be  explored 
In  the  sections  which  follow.  Include:  (a)  proportion  of  correct 
classification,  (b)  typology  reproduction,  (c)  weighted  dimensional 
distances,  (d)  zero-one  counts,  and  (e)  information  reduction.  Separate 
from  U>1s  Is  the  whole  issue  of  a declsl on-tree  (with  or  without 
Bayesian  inputs),  and  the  non-subs tanti ve  matters  of  efficiency,  cost, 
and  ease,  "''hese  latter  issues  also  are  dealt,  with, 
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BASIC  PREPARATION  OF  THE  DATA 
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The  purpose  of  this  section  is  twofold:  to  describe  the  data  set 
to  be  used  for  the  remainder  of  this  study;  and  to  describe  the  four 
classification  techniques  (distance  statistic,  multiple  discriminant 
furtctton,  Bayes,  and  decision  tree)  ns  they  will  be  used  in  the  research 
to  be  t ubs*qu«rttly  reported. 


The  existing  national  (civilian)  normative  file  of  the  Survey  of 
Organizations  ($00)  contains  5,599  groups.  It  represents  the  total  body  of 

data  collected  since  1966  from  some  37,090  persons  In  a broad  segment  of 
the  civilian  Industrial  population.  As  such,  it  represents  many  different 
industries,  functions,  and  hierarchical  levels. 

Available  also  are  data  from  two  independent  military  samples. 

The  first  of  these  contains  more  than  200  usable  groups  of  Navytnen 
from  whom  questionnaire  data  on  S00  indexes  were  collected  In  late  1972 
and  early  1973.  The  second  contains  320  groups  of  Army  soldiers  from 
whom  data  were  collected  in  late  1974  and  early  1975.  In  each  of  these 
Instances*  in  order  to  satisfy  the  need  for  intact  units,  it  was  decided 
to  collect  data  from  all  members  of  a selected  nutrtber  of  organizational 
subunits  Or  "modules."  These  modules  consisted  of  n pyramid  of  work 
groups  three  echelons,  or  tiers,  tall.  Thus  data  were  collected  from  all 
rtSflWttmrs  of  the  three  organizational  levels  immediately  below  a designated 
"module  head."  Modules  themselves  were  selected  by  what  amounts  to  a 
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stratified  random  sampling  procedures.  Methods  are  spelled  out  in  greater 

detail  in  two  technical  reports  (Mlchaelsen,  1973;  Spencer,  1975.) 

Taken  together,  these  various  data  sets  comprise  a sample  of  6,119 

groups,  to  be  employed  in  the  main  analyses.  For  other  analyses,  per- 
formance measures  truty  also  he  employed  for  a subset  containing  940  groups 
for  which  measures  of  efficiency  and/or  attendance  are  also  available. 

The  measures  themselves  have  been  examined  extensively  In  the  course  of 
another  project  and  their  properties  reported  (Pecorella  & Bowers,  1976a; 
1976b.)  They  have  been  converted  to  standardized  score  form  to  attain 
Inter-organizational  coinparabl lity. 

From  the  onset  of  the  project  to  the  present  time,  these  various 
data  sets  have  been  reformated  so  that  all  sha»*e  a common  format.  All 
data  have  been  entered  into  a single  large  file  with  merged  superordinate 
values  (those  from  the  yroup  Immediately  above),  plus  with  merged  values 
for  a second  wave  of  survey  data  where  such  a second  wave  exists. 


Measures  Used 


Organizations  contains  in  its  19/4  edition  16  standard 


indexes.  Two  of  these,  bexausu  they  have  not  been  universally  used  since 
the  start,  of  the  data  bank,  will  be  dropped.  The  14  which  remain  will 
form  the  survey  index  measures  to  be  used  in  the  present  study: 
Organizational  Climate 

Decision  Making  Practices  — the  manner  in  which  decisions 
arc  made  in  the  system:  whether  they  are  made  effectively, 
made  at  the  right  level,  and  based  upon  all  of  the 
available  Information  (4  Item  Index). 

Communi cation  Flow  --  the  extent  to  which  Information  flows 
freely  in  all  directions  (upward,  downward,  and  laterally) 
through  the  organization  (3  Itt'in  Index). 


i vi 
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Motivational  Conditions  --  the  extent  to  which  conditions 
(people,  policies,  and  procedures)  in  the  organ  1 z.i  ; ion 
encourage  or  discourage  effective  work  (3  item  index). 

Human  Resources  Primacy  — the  extent  to  which  the  climate, 
as  reflected  In  the  organization's  t radices,  is  one  which 
asserts  that  people  are  among  the  organization's  most 
important  assets  (3  item  index). 

Lower  Level  Influence  — the  extent  to  which  non- supervisory 
personnel  anti  first-line  supervisors  influence  the  course  of 
events  in  their  work  areas  (2  item  index). 


Supervisory  Leadership 

Supervisory  Support,  — the  behavior  of  a supervisor  toward 
6 subordinate  .which  serves  to  Increase  the  subordinate's 
feeling  of  personal  worth  (J  item  index) 

Supervisory  Team  Building  - behavior  which  encourages 
subordinates  to  develop  mutually  satisfying  Interpersonal 
relationships  (2  item  index). 

Supervisory  Goal  Lmphasis  — behavior  which  generates 
enthusiasm  (not  pressure)  for  achieving  excellent 
performance  levels  (2  Item  index) 

Supervisory  Wcrk  Facilitation  --  behavior  on  the  part  of 
supervisors  which  removes  obstacles  which  hinder  successful 
task  completion,  or  positively,  which  provides  the  means 
necessary  for  successful  performance  (3  item  index) 


Pfcgr  Leadership 

Peer  Support  — behavior  of  subordinates,  directed  toward 
one  annfUer,  which  enhances  each  member's  feeling  of 
personal  worth  (3  item  index). 

Peer  Team  Building  — behavior  of  subordinates  toward  one 
another  which  encourages  the  development  of  close,  coopera- 
tive working  elationships  (3  item  index). 

Peer  Goal  Emphasis  ~ behavior  on  the  part  of  subordinates 
which  stimulates  enthusiasm  for  doing  a good  job  (2  item 
index) . 

Peer  Work  Facilitation  — behavior  which  removes  roadblocks 
to  doing  a good  job  (3  item  index). 


Satis  ruction 


a measure  of  general  satisfaction  made  up  of 
nq  satisfaction  with  pay,  with  the  supervisor, 
with  co-workers  (peers),  with  the  organization,  with 
advancement  opportunities,  and  with  the  .job  itself 
(7  Item  index). 
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The  typology  of  work  gi  oups  to  be  used  in  this  study  Is  reported  in 
Bowers  and  Hausser  (1977),  contains  17  types,  and  is  based  on  the 
indexes  of  the  Survey  of  Organizations.  The  resulting  types  have  different 
profiles  across  the  indexes,  with  the  patterns  of  these  profiles  being 
quite  distinct..  In  the  present  project,  for  purposes  of  evaluating  the 
different  classification  procedures,  all  work  groups  will  first  be  placed 
into  one  of  the  17  types  by  each  of  four  methods:  distance  function, 
decision  tree,  multiple  discriminant  function,  and  Bayes.  A description 
of  each  of  these  techniques  and  their  Implementation  in  this  stuqy  1c 
given  below. 

Dis tance  Function  Method 

The  HGROUh  program  u;ed  fri  the  original  generation  of  the  typology, 
described  earlier,  uses  a generalized  distance  function  based  on  the 
error  sum  of  squares  --  the  squared  deviations  from  group  means.  In 
the  case  of  the  classification  system  applied  to  new  or  "incoming"  groups, 
this  present  method  calculates  a similar  distance  statistic  between  the 
profile  of  indexes  for  the  group  to  be  classified  and  the  profiles 
(mean  index  values)  for  each  of  the  (7  standard  types.  The  group  is 
then  assigned  to  the  type  for  which  the  distance  value  Is  smallest. 

All  6,100  (approximately)  work  groups  will  be  so  classified  by  the 

1 

distance  function  method.  This  classification  will  serve  as 
the  "expert"  or  correct  classification  in  this  study.  From  tills 
assignment  it  will  be  possible  to  calculate  the  vector  of  means  and  the 
variance- covariance  matrices  for  each  type,  as  well  as  estimate  the 
distribution  of  werk  group  types. 
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Oxlflon  Tree  Method 

A decision  tree  Is  effectively,  a sequenced  series  of  questions, 
e*ch  a limited  set  of  alternatives,  which,  when  one  Is  selected, 
leads  to  the  next  branch  of  the  tree  and  to  the  next  question.  A simple 
example  is  given  In  figure  3.  This  example  contains  three  questions 
(A,  $,  & C) , the  first  two  with  two  alternatives  each,  the  third  with 
&rae.  The  result  Is  twelve  end  states  (not  all  of  which  need  be  distinct; 
only  the  routes  getting  there  need  be  distinct.)  In  applications  to 
classification,  the  end  states  represent  the  categories  of  the  typology- 
The  development  of  a decision  tree  is  not  a data  based  procedure, 
but  rather,  a theoretical  one.  For  the  purposes  of  diagnosing  work  groups 
Into  the  existing  seventeen-class  typology,  a decision  tree  whose  questions 
will  relate  to  relative  performance  on  the  14  Indexes  of  the  SOO  Is 
presently  being  developed.  The  actual  format  of  this  decision  tree 
will  be  presented  in  a separate  technical  report. 

The  decision  tree  algorithm  will  be  used  to  classify  all  (approximately) 
6,100  work  groups  in  the  data  set. 

t 

Multiple  Discriminant  Fi  net  Ion  Method 

The  use  of  multiple  discriminant  functions  to  assign  units  Into 
classes  within  a typology  Is  a standard  procedure.  Essentially,  the 
process  is  to  find  linear  (or  quadratic)  functions  of  thu  predictor 
variables  which  maximally  differentiate  among  the  groups  of  the  typology. 

With  more  classes  in  our  typology  than  predictor  variables,  the  maximum 
nuatoer  of  discriminant  functions  Is  equal  to  the  number  of  variables. 


c ay  r.  While  typically  almost  all  of  the  explainable  variance  Is  accounted 
F*r  by  the  first  two  discriminant  functions,  It  Is  possible  to  uti  lize 
«11  r function*.  The  primary  value  In  the  use  of  discriminant  functions 
4ft  Viet  they  ere  orthogonal . 

Notetlomally,  the  1th  discriminant  function  is  given  by: 

Y1  " /m]  *1JXj 

where  a^,  . *ir  are  the  weights  and  Xi , X2,  ....  X^  ere  the 
predictor  variables.  Thus  the  r discriminant  functions  are  given  by 
Yj,  Yj,  ....  Yr.  The  rocedure  for  finding  the  weights  a^ 4 results  In 
i explaining  He  most  variance,  Y E the  second  largest,  etc.  In  particular, 
the  weights  a^j  are  chosen  to  maximize  v 

mlMSSi  ^iMPJ}JDgJ!lL-£IL  Xi  . 
variance  within  groups  on  Y^ 

The  weights  a^  am  calculated  from  a developmental  sample.  Once  they 
am  calculated,  the  next  step  Is  to  calculate  the  vector  of  Y values  for 
each  work  group,  say  y,  for  each  class  within  the  typology,  the  means 
of  the  y- vectors,  say  g.,  ahd  the  variance-covariance  matrices,  are 
calculated.  Finally,  with  the  above  information,  one  Is  ready  to  begin 
i.h«  class l fi cation  process. 

Under  certain  distributional  assumptions.  Including  equal  variance - 
covert eftcu  matrices  across  types,  the  assignment  of  a new  work  group 
requires  the  calculation  of  the  vector  of  discriminant  function*,  y.„  and 
the  Mahalanobls  distance  between  y.  and  the  center  of  each  of  the  types. 
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The  actual  assignment.  Is  to  the  type  to  which  y.  Is  the  closest.  If  one 
Is  not  able  to  assume  the  common  Variance- covariance  matrices,  the 
assignment  procedure,  while  still  possible.  Is  more  Involved.  It 
effectively  Is  a maximum  likelihood  procedure  which  then  classifies 
a work  group  Into  the  type  which  has  the  largest  probability. 

In  this  study*  the  actual  discriminant  functions  will  be  calculated 
from  a developmental  sample  of  (1*3,000  (approximately),  which  represents 
half  of  the  total  data  set.  (The  developmental  sample  will  be  selected 
randomly  from  the  entire  set.)  The  use  of  a subset  as  a developmental 
sample  will  permit  the  assignment  of  both  the  developmental  sample  and 
the  remainder  of  the  data  set  as  a check  on  the  general Izabi  11  ty  of  the 
procedure  across  similar  data  sets.  The  actual  computer  operations  to  be 
performed  are  those  In  the  DISCRIMINANT  routine  within  MIDAS,  the 
statistical  package  available  at  the  University  of  Michigan  (Fox  & Gulre, 
1976.)  Included  In  the  output  Is  a test  of  the  homogeneity  of  variance- 

covariance  matrices  and  the  distances  between  the  means  of  the  types. 

Bgyeslan  Method 

As  earlier  sections  of  this  report  have  Indicated,  the  role  of  the 
organizational  development  specialist  can  be  linked  to  that  of  the 
physician.  In  both  Instances,  the  specialist  Is  conrontert  with  an 
entity  (a  person  or  an  organization)  which  exists  in  some  "disease"  state. 
The  specialist  must  diagnose  the  disease  and  prescribe  treatment.. 
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Lad ley  and  Lusted  (1959)  described  the  use  of  probabilities  In  medical 
diagnosis.  The  approach  that  they  used,  which  wakes  use  of  Bayes's 
Theorem,,  has  been  called  a “Bayesian"  approach  to  diagnosis,  because 
It  utes  Bayes's  Theorem  as  the  model  for  developing  final  probabilities 
for  each  possible  disease  state.  It  1$  also  called  a relative  frequency 
approach  because  it  uses  relative  frequencies  derived  from  historical 
data  to  develop  the  probabilities  that  enter  Bias's  Theorem. 

Historically,  probabilities  have  been  defined  In  terms  of  relative 
frequencies.  For  Instance,  the  probability  of  obtaining  tails  In  a fair 

i 

coin  toes  1$  .5.  This  probability  comas  from  the  fact  that  If  one  were 
to  toes  a coin  or  coins  a Urge  nunteer  of  times,  the  relative  frequency  of 
tails  would  be  about  .5.  Intuitively,  this  definition  of  probability 
makes  sense,  at  least  when  the  events  In  question  are  truly  repeatable. 

There  Is  another  school  of  thought,  however,  that  finds  the  relative 
frequency  notion,  unduly  restricting.  Consider  a unique  event,  such  as 
the  election  of  the  President  of  the  United  States  In  I960.  Such  an  event 
occurs  only  onca  In  history,  and  from  a relative  frequency  point  of  view. 

It  would  be  meaningless  to  talk  about  probabilities,  e.g.,  the  probability 

that  a particular  Individual  will  win  that  election.  The  probability  of  the 
event  Is  either  tero  or  one,  depending  on  whether  It  occurs  or  does  not  occur. 

On  the  other  hand,  it.  Is  perfectly  clear  that  odds  makers  and  others 
Involved  in  betting  are  not  restricted  to  relative  frequency  notions. 

Whoire  relative  frequency  data  are  available,  a perfectly  rational  oddsmaker 
would  give  bettors  a set  of  odds  that  exactly  match  the  relative 
frequencies  (Ignoring  questions  of  commissions,  "house  cut,"  etc.). 
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Whore  relative  frequency  data  are  not  available,  however , the  oddsmeker 
relies  on  a subjective  probability.  That  subjective  probability  1 . a number 
between  zero  and  onto  which  represents  thu  extent  to  which  the  person 
believes  that  a certain  statement,  Is  true. 

The  Bayesian  school  of  statistics  defines  probability  In  the  above 
sense.  To  a B ay  as  1 art  statistician,  all  probabilities  are  subjective 
probabilities.  They  represent  degrers  of  belief  rather  than  relative 
frequencies.  Subjective  probabilities  should  conform  to  all  rules,  of 
probability  theory  just  as  do  probabilities  defined  Irt  some  other  way. 

The  Bayesian  approach  allows  the  statistician  to  consider  a broader 
amount  and  type  of  Information,  Incorporating  It  into  Ms  calculations, 
than  dots  a classical  approach. 

The  Bayaslan  approach  p n josed  in  this  section  is  similar  to  that 
which  has  been  used  In  medical  diagnosis,  since  It  uses  Bayes's  Theorem 
and  also  relics  on  historical  data.  It  goes  beyond  historical  data, 
however,  by  using  subjective  probabilities  rather  than  simple  relative 
frequencies.  The  reasons  for  the  approach  will  become  more  apparent 
shortly.  In  the  meantime,  however,  a brief  introduction  to  Bayes's 
Theorem  Is  In  order. 


The  diagnosis  sit'  is  characterized  by  uncertainty.  The  diagnos- 
tician Is  faced  with  a let  of  mutually  exclusive  and  exhaustive  states 
(diseases)  Into  one  of  which  the  organization  must  belong.  The  diagnos- 
tician's job  Is  to  Identify  correctly  the  category  that  the  organization 


belong*  in.  The  organization  bring*  to  the  situetion  certain  character- 
istics, akin  to  symptoms  In  the  medical  context,  that  provide  the 
diagnostician  with  information  about  which  disease  states  are  likely  to 
occur. 

Bayes'*  Theorem  is  a relatively  simple  equation  that  defines  the 
probabi] istic  relationship  between  symptom  combinations  and  the  resulting 
disease  steles.  For  the  nomant,  consider  probabilities  as  relative 
frequencies.  Bayes's  Theorem  Is  basically  concerned  with  three  probability 
treasures. 

frior  onabafrl  llty  Is  the  probed>1 1 1 ty  of  a disease  state  lr  the 
general  population.  To  state  It  another  way.  assume  that  a diagnostician 
must  diagnose  a certain  organisation  without  receiving  a ay  symptom 
la  formation  about  the  organization.  His  entire  set  of  knowledge  about 
the  situation  consists  of  a table  of  relative  frequencies  of  each 
disease  state  In  the  general  population.  The  dlagnostl den's  prior 
probability  for  any  disease  cteba  would  be  the  probability  of  the  event's 
occurrence  given  that  the  diagnostician  knows  nothing  about  the  organization 
symptom,  if.  for  Instance,  historical  data  show  that  five  percent  of 
all  organizations  have  disease  A.  then  the  prior  probability  of  disease 
A Is  ,06.  Prior  probability  of  disease  state  I win  be  designated 
P(Df). 

The  1 1kel Ihood  rtf  a symptom  or  symptom  complex  is  the  probabi  llty 
of  the  symptom  or  complex  when  the  organization  is  known  to  haw*  a certain 
disease.  Suppose,  for  Instance,  that  of  all  organization*  that  hare 
disease  A,  it  is  known  that  BOX  of  then  hare  symptom  X.  Than  the  like- 
lihood of  symptom  X given  di  tease  A Is  .6.  The  likelihood  of  symptom  1 
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given  diseas*  1 Is  written  P(Sj  / Dj)-  The  likelihood  of  a symptom 
complex  of  n different  symptoms  Is  written  P(Si,$*,  . ...  / D^). 

Posterior  probability  Is  tha  probability  of  a disease  state  given  the 
occurrence  of  a given  symptom  or  symptom  coaplex.  Posterior  probability 
' Is  designated  P( D,  / Si,$*,  ....  S^). 

Assume  that  there  are  m possible  decease  states,  0^.  There  are  also 
rt  possible  different  symptoms  Sj.  Then  Bayes's  Theorem  states  that  the 
prior  probability*  likelihood,  and  posterior  probability  for  any  disease 
state  and  symptom  complex  are  related  as  follows: 


) P(Sl,Sa»  Sn  ; 

SP(S,. S2 % f 0.)P(D,  1 

1«1  n 1 1 

Historical  data  hove  tradl tl onal ly  provided  the  relative  frequencies 
that  were  used  as  prior  probabilities  and  likelihoods.  Researchers  would 
compile  a data  bank  of  diagnosed  cases,  and  from  this  set  of  data  would 
develop  tables  showing  the  relative  frequency  of  occurrence  of  each 
disease  state  and  of  each  symptom  complex  for  any  given  disease.  With 
f this  information,  the  diagnostician  could  apply  Bayes's  Theorem  when 

conirorted  with  a certain  symptom  complex,  arriving  at  a posterior 
probability  for  each  possible  disease. 


■P(D,  / S^Sa Sn 
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Prob t#mi  Uith  Historical  Data  co\d  Ralaf+ve  Fm(]wntii#n 
Snverel  prtib leans  mult  from  the  exclusive  use  of 
historical  data  and  relative  frequencies  In  developing  the  prior  proba- 
bilities and  likelihoods.  One  of  these  *s  that  there  may  be  .0  objective 
criterion  to  determine  whether  or  not  the  diagnoses  were  correct. 

The  result  Is  a "Catch-22''  situation.  If  objective,  reliable  diagnosis 
criteria  already  exist,  then  there  Is  no  need  to  develop  a new  technique 
for  diagnosis.  But  If  such  criteria  do  not  exist,  then  there  Is  >10  way 
to  judge  whether  or  not  the  new  technique  is  valid.  Ultimately,  the  value 
of  diagnosis  Is  In  Its  resultant  treatment  prescription.  The  objective 
of  diagnosis  should  be  to  classify  cases  (organisations  or  patients)  Into 
different  treatment.  <ategori«is,  i.e.,  Into  classes  in  which  each  member 
of  the  class  reacts  to  any  treatmnnt  the  sane  as  all  other  members  of 
the  class.  Since  the  historical  data  mqv  co  tain  Incorrectly  diagnoses 
cases,  the  relative  frequencies  based  on  those  diagnoses  will  obviously 
be  Incorrect.  This  makes  both  the  .prior  probabilities  and  the  likelihoods 


suspect, 

A second  problem,  perhaps  even  more  severe.  Is  that  most  data  banks 
are  too  limited  to  provide  an  adequate  set  of  relative  frequencies. 
Unless  the  set*  of  diseases  and  symptoms  are  severely  limited,  a very 
large  set  of  cases  is  needed  to  develop  »u.vrpU  relative  frequencies. 

For  instance , consider  the  limited  situation  lit  which  there  are  only 
four  different  symptom  under  consideration  and  each  symptom  can  take  on 
only  twit  values.  There  «re  2",  or  16  different  possible  -ywptom 
contoi nations,  how  assume  that  there  are  three  possible  disease  states. 
This  means  that  there  are  lt>  x 3,  or  48  different  likelihoods  to  compute, 
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one  for  each  symptom  complex  for  each  disease.  Ah  adequate  data  base 
would  require  several  hundred  cases  In  order  to  be  assured  that  a reasonable 
number  of  the  cells  In  the  relative  frequency  table  for  each  disease 
were  non-atero. 

Cells  whose  relative  frequency  Is  zero  present  a special  problem. 
Particularly  when  the  data  base  Is  small,  there  will  be  a significant  number 
of  empty  cells.  Even  the  most  pure  relative  frequeritlst  statistician 
will  be  quick  to  admit  that  not  every  empty  cell  should  lead  the  researcher 
to  assume  that  the  probability  of  that  symptom  conplex  Is  zero.  The 
researcher  has  no  sure  way  of  knowing  whether  the  empty  celln  should  in 
fact  be  cells  with,  a zero  probability  of  containing  cases,  or  whether  they 
are  cells  with  a low  but  non- zero  probability,  or  whether  sampling  error 
has  caused  them  to  be  empty. 

This  zero- frequency  problem  will  be  particularly  acute  in  the  current 
study.  The  symptoms  for  the  study  will  be  scores  on  14  different  Indexes 
on  the  Survey  of  Organizations.  Each  Index  can  take  on  virtually  an 
Infinite  number  of  values  between  one  and  five.  However,  In  order  to 
develop  relative  frequency  tables,  the  data  will  be  grouped  Into  Intervals. 
There  arc  at  least  tf  ree  ways  of  defining  such  Intervals. 

First,  one  could  simply  taxe  the  possible  range  of  scores  and  divide 
this  range  Into  Intervals  of  equal  length.  Thin  would  provide  equal 
raw-score  Intervals. 

A second  method  would  be  to  obtain  standard  scores  for  each  case 
an  each  variable,  and  divide  the  scores  by  equal  standard-score  units. 

The  third  way  would  be  to  define  percentiles,  and  dlvldt  the  scores 
Inti  equal  percentile  Intervals. 

I 
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Regardless  of  which  Interval  classification  method  is  chosen,  the 
scores  will  be  grouped  into  a fairly  large  nunber  of  intervals.  For 
Instance,  each  variable  could  be  grouped  into  Intervals  that  define  the 
five-percent  points  of  a percentile  scale,  thereby  reducing  the  set  of 
possible  values  to  20.  This  set  of  14  Indexes,  each  with  20  possible 
values,  yields  a set  of  20l\  or  6.55  x 10JO  possible  symptom  contl nations . 
Any  data  bank  Is  going  to  be  too  small  to  provide  for  more  than  a very 
small  percentage  of  non-zero  cells  in  such  a symptom  matrix. 

There  is  one  condition  under  which  the  number  of  cells  becomes  more 
manageable.  When  the  symptoms  are  all  conditionally  independent,  the 
researcher  need  riot  fill  in  all  20 possible  cells.  One  needs  to  know 
only  the  marginal  values,  i.e.,  the  relative  frequency  of  each  value  for 
each  single  symptom.  In  this  case  there  would  be  lb  x 20,  or  320  cells. 
Furthermore , such  a 320-cell  matrix  would  need  to  be  completed  for  each 
of  the  disease  states  under  consideration.  In  the  current  study  using 
the  results  of  tire  H-' GROUP  classification  program  there  are  17  different 
profiles  or  disease  states.  In  order  to  have  even  the  potential  of  having 
every  cell  be  non-zero,  one  would  need  20  cases  for  each  disease  state, 
or  340  cases  in  the  data  bank.  The  actual  data  bank  Is  considerably 
larger  than  340  cases.  This  large  sample  size  will  provide  fairly  accurate 
descriptive  statistics  on  each  Index  (variable)  for  emfr  ril  «»*<;*  stats, 
but  will  still  contain  a large  number  of  zero- frequency  cells. 

It  Is  known  that  the  data  are  not  conditionally  independent.  This 
can  be  both  an  advantage  and  a disadvantage.  The  disadvantage  lies  In  the 
fact  that  unless  one  knows  precisely  the  exact  relationship  among  the 
symptoms.  Incorporating  the  knowledge  of  the  non-Independence  Into 
Bayes's  Theorem  Is  Impossible.  One  Is  left  with  the  problem  of  estimating 
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2016  cell  frequencies.  The  advantage  of  knowing  that  the  data  are  not 
conditionally  Independent.  Is  that  ir  may  enable  the  observer  to  reduce 
the  number  of  symptoms  that  must  tie  considered  in  making  tire  diagnosis. 
For  Instance,  If  knowing  X provides  a great  deal  of  confidence  about  the 
value  of  Y,  one  need  not  observe  Y,  since  it  will  add  little  new 
information  to  the  situation. 


j uaaeotvd  Sol 


the  Problem 


The  problem  of  massive  relative  frequency  tables  is  faced  in  virtually 
every  non-trivinl  diagnosis  situation.  Such  large  tables  make  it  almost 
Impossible  to  create  an  adequate  data  base  for  generating  cell  frequencies, 
and  are  also  difficult  to  apply,  since  there  is  such  an  overwhelming  number 
of  cell  values  to  use  in  applying  BuyebS  T -orem.  Gustafson  et  al. 

(1969)  suggested  that  the  proper  way  to  .andle  this  problem  Is  through 
man-machine  systems  In  which  man  esMmaV^{  the  likelihoods  based  on  his 
personal  knowledge  and  the  historical  data  base. 

A similar  solution  may  be  In  order  in  the  current  situation.  In  order 
to  use  such  an  approach,  however,  one  must  accept  the  notion  of  subjective 
probabi  lity  that  was  discussed  earlier.  Recognizing  that  the  data  base 
that  is  used  to  generate  the  likelihoods  is  not  going  to  be  sufficiently 
luryu  Lu  provide  accurate  estimates  of  cell  frequencies,  one  can  use  the 
data  base  as  a random  sample  from  which  one  can  make  Inferences  about 
the  theoretical  distribution  of  cell  frequencies. 

In  other  words,  the  cell  relative  frequencies  become  not  the  like- 
lihoods themselves  but  clues  from  which  one  develops  likelihoods.  The 
researcher  Is  seeking  for  a way  of  describing  u degree  of  belief  abou1 
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the  occurrence  of  the  empty  cells,  and  Is  not  restricted  to  a simple 
frequency  count. 

For  the  time  being,  igno.t>  the  problem  of  non-indepandence:  assume 
that  the  symptoms  are  conditionally  independent  from  each  other.  For  this 
situation  that  means  that  one  must  determine  the  likelihoods  for  16  x c) 
x 17  = 5,440  different  symptom- disease  cells.  In  an  earlier  study 
(Bowers  & Hausser,  1975)  using  civilian  and  Navy  work  groups,  each  of  the 
17  different  profiler,  was  represented  by  at  least  two  percent  of  the 
groups  in  the  defining  samples.  In  that  study,  this  meant  that  there  were 
at  least  11  groups  In  each  of  the  17  profile  categories.  Although  11 
groups  is  not  enough  to  assure  that  all,  or  even  most,  of  the  cells  In 
the  relative-frequency  table  are  non-zero,  it  does  provide  a large  enough 
sample  to  obtain  a fairly  good  estimate  of  the  mean  and  standard  deviation 
of  each  index  for  each  profile  type. 

Consider  each  symptom  (index)  as  a random  variable  that  can  take  on 
any  of  a number  of  values  that  are  at  least  ordinal  in  nature  and  are 
probably  interval  measures.  Intuitively,  It  makes  sense  to  assume  that,  for 
any  disease  state,  the  distribution  of  values  of  any  given  index  is  an 
orderly  on<  . The  historical  data  base  provides  a random  sample  of  values 
of  the  1»  aex,  but  is  subject  to  all  the  sampling  errors  inherent  in  any 
random  sample.  From  this  point  of  view,  It  becomes  unnecessary  to  maintain 
the  iirtificially  Imposed  interval  groupings  of  the  data  scores.  The 
groupings  were  originally  done  in  order  to  create  cells  whose  relative 
frequencies  were  to  be  estimated.  The  actual  data  values,  however,  fall 
along  a continuous  line,  and  may  be  considered  as  having  been  generated  by 
an  underlying  continuous  probability  density  function.  Now,  Instead  of 


looking  at  cell  frequencies,  simply  observe  the  mean  and  standard  deviation 
of  the  defining  sairple.  From  these  sample  statistics  one  can  infer  the 
characteristics  of  the  underlying  data  generating  population. 

The  researcher  must  decide  on  a family  of  distribution  functions  which 
will  be  assumed  to  be  the  data  generating  functions  for  the  likelihoods. 

Two  continuous  distributions  that  immediately  come  to  mind  are  the  normal 
distribution  and  the  beta  distribution.  Both  distributions  are  well-knorn, 
easily  described  distributions . Whether  or  not  either  of  them  describes 
adequately  the  underlying  distributions  for  the  Indexes  will  be  determined 
In  the  course  of  the  research,  but  using  either  of  them  should  result  In 
more  accurate  posterior  probability  estimate  than  would  the  use  of 
inadequate  cell  relative  frequencies. 

An  example  might  make  the  picture  clearer.  Table  3 illustrates  how 
one  might  derive  likelihoods  using  the  sample  mean  and  standard  deviation 
as  a starting  point. 

Example 

Assume  that  l>ie  historical  data  base  has  classified  50  work  groups 
Into  Profile  A.  For  Index  X,  the  distribution  of  scores  Is  shown  in 
Table  3. 

Figure  4 Illustrates  the  frequency  distribution  for  the  sample. 

From  the  mean  and  standard  deviation  ui  Uie  sample,  best-  Ci  llimj  normal 
and  beta  distributions  can  be  derived.  (See  Figure  5 and  Figure  6.) 

From  these  distributions  one  then  can  develop  likelihoods  for  the 
20  intervals.  Table  4 displays  the  likelihoods  derived  from  the  three 
methdds.  Notice  that  using  of  either  the  normal  distribution  or  the 
beta  distribution  eliminates  the  zero- frequency  problem,  and  smooths 
out  the  curve. 


52 


Table  3 

SCOPES  OF  50  HYPOTHETICAL  GROUPS 
FORMING  SAMPLE  FOR  EXAMPLE  1 


. 1 ' n .-»i  >1  i I -n  I i .ii  r i m mm  I • i T i ■'  11  l I »■  1 • ...  " »-  T"'""B  11  ->  ■ - 

Group  N unfoe r 

Mean  Score 

Group  Number 

Mean  Score 

1 

1.71 

26 

2.81 

2 

1.95 

27 

2.83 

3 

2.30 

28 

2.85 

4 

2.32 

29 

2.86 

5 

2.36 

30 

2.87 

6 

2.37 

31 

2.91 

7 

2.40 

32 

2.94 

8 

2.44 

33 

2.94 

9 

2.48 

34 

2.99 

10 

2.49 

35 

3.03 

11 

2.51 

36 

3.10 

12 

2.51 

37 

3.12 

13 

2.52 

38 

3.15 

H 

2.53 

39 

3.16 

16 

2.56 

40 

3.18 

16 

2.62 

41 

3.24 

17 

2.63 

42 

3.25 

18 

2.34 

43 

3.26 

15 

2.65 

44 

3.27 

20 

2.65 

45 

3.31 

21 

2.67 

46 

3.40 

22 

2.67 

47 

3.54 

23 

2,69 

48 

3. CO 

24 

2.70 

49 

3.71 

25 

2,77 

50 

3.92 

Mean  of  Groups 

- 2.7876 

Standard  Deviation  of  Groups  = 

.5527 

* 


H 


i 

i 

i 

.1 

k 
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Figure  4 

Hlstoyrams  of  the  Relative  Frequency  of  Scores  In  Example  1 
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Table  4 

UKEimOORS  FOtt  MOORES  IN  20  INTERVALS  BASED  ON 
RELATIVE  FREQUENCY,  NORMAL  DISTRIBUTION  , AMD  BETA  DISTRIBUTION 


Mafcbod  of  Calculation 


Interval 

Frequency 

Normal 

Beta 

1.  1.0-1. 2 

Q 

,0014 

.0001 

2.  1.2-1. 4 

0 

.0040 

.0017 

3U  1.4-1. 6 

0 

.0098 

.0082 

4.  1.6-1. 8 

.02 

,021 

.023 

ft.  1. 8-2.0 

.02 

.040 

.047 

S.  2. 0-2. 2 

0 

.068 

.075 

7.  2. 2-2. 4 

.08 

.097 

.103 

8.  2. 4-2. 6 

.18 

.125 

.124 

9.  2.6-2. 8 

.20 

.141 

.134 

10.  2. 8-3.0 

.18 

.140 

.131 

11.  3. 0-3. 2 

.12 

,125 

.115 

12.  3. 2-3. 4 

.10 

.09? 

.092 

13,  3. 4-3.6 

.04 

.063 

.066 

14.  3,6-3. 8 

.04 

.037 

.042 

IS.  3. 8-4.0 

.02 

.019 

.023 

16.  4.0-4. 2 

0 

.0091 

.010 

17.  4.2-4. 4 

0 

.0035 

.0036 

18.  4.4-4. 6 

0 

.0012 

.0008 

19.  4.6-4. 8 

0 

.0004 

.0001 

20.  4. 8-5.0 

0 

<,0001 

.00003 
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Dealing  With  the  Proble  > of  Non- Independence 

The  problem  of  non -independence  of  symptoms  Is  a difficult  one  to  deal 
with  In  a Bgyeslan  framework,  Gustafson  et  al.  (1969)  suggested  that  In 
the  medical  situation  the  proble. i could  be  alleviated  by  having  the 
diagnosing  physicians  cluster  the  data  into  groups  that  they  perceive  as 
being  relatively  Independent.  Again,  a somewhat  similar  procedure  could  be 
used  In  the  present  study.  It  Is  straightforward  to  obtain  a matrix  of 
pairwise  correlations  among  the  14  Indexes  used  In  the  organizational 
diagnosis.  From  such  a correlation  matrix,  the  Indexes  can  be  grouped 
Into  relatively  independent  clusters.  When  data  are  conditional  ly 
Independent,  the  application  of  Bayes's  Theorem  Is  straightforward. 


Let  P(Sj  / Dj)  be  the  likelihood  for  a given  symptom  cluster,  and 

assume  that  each  symptom  cluster  Is  Independent  of  all  others.  Let 

Q(S^)  be  the  sum  over  all  disease  st.ites  (Indexes)  of  the  P(Sj/D^)P(D^ ). 

In  other  words,  Q(S.)  * E P(S.  / D.  )P(D, ).  Then  the  posterior  probability 

7 1*1  " ' ' 

of  disease  state  1 when  only  symptom  complex  j has  been  observed  is: 


P(SJ  / D1)P(01) 


Now  If  one  observes  not  Just  symptom  complex  j but  all  m independent 
symptom  complexes,  the  posterior  probability  becomes: 

P(S*  / / D^.-.P^  / D1)P(Di) 

P(°1  / Sl,Si( V “ ' 

Q(Sl)Q(S2)...Q(Sm) 


Creating  such  conditionally  Independant  clusters  thus  simplifies 
the  computation  of  posterior  probabilities  once  the  likelihoods  of  the 
clusters  have  been  determined. 

The  diagnostician  must,  however,  still  deal  with  the  issue  of 
n>  n-indepen  dance  within  each  cluster.  One  way  to  deal  with  it  would  be 
to  compute  a single  score  for  the  cluster,  an  average  of  the  scores  for 
the  Indexes  comprising  the  cluster.  This  Is  simply  a data  reduction  method. 

A second  wqy  of  dealing  with  this  issue  would  be  to  look  at  the 
multiple  correlations  of  related  items,  and  reduce  the  likelihoods  based 
on  those  correlations.  For  Instance,  suppose  that  Index  A and  Index  B 
have  a correlation  coefficient  of  .70.  This  means  that  .49  of  B's  variance 
Is  accounted  for  by  variability  in  A.  If  the  two  indexes  were  conditionally 
independent,  one  would  obtain  the  likelihood  of  the  AB  combination  by 
simply  multiplying  the  likelihoods  for  the  two  indexes.  But  since  the  two 
are  correlated,  multiplying  line  two  likelihoods  will  yield  a likelihood 
for  the  cluster  that  Is  too  large.  That  over-estimation  can  be  compen- 
sated r'or  by  mk  cing  the  individual  likelihoods  by  the  amount  of  variability 
that  they  share.  Thus,  if  the  likelihood  ratio  for  two  profiles  was  X 
If  the  symptoms  were  Independent,  ttwtn  that  likelihood  ratio  would  be 
. SIX  if  they  were  correlated  by  .70.  This  is  because  the  common  variance 
* rs,  or  ,48.  Unique  variance  of  each  is  1.00  - .49,  or  .51. 

A. word  about  likelihoods  and  likelihood  ratios  is  in  order.  One  of 

the  more  difficult' values  to  obtain  in  using  Bayes's  Theorem  Is  thi 

■ n 1 

denominator  of  the  aquation,'  l P($j.,$2,  . , . , ' D,)P(D.).  When 

•j»1  nil 

iiiyes's  Theorem  is. written  in  whet.is  cal  led  ..the  odds- like  II  hood  ratio  form, 

' . r- 

itiat  ctonteblMtor'  drops  odt.  , Odds  arc  simply  ratios  ■ i 


I ' rob, ibl 11  ties. 
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The  prior  odds  of  disease  i against  disease  k are  simply  the  ratio  of  the 

P(D,) 

prior  probabilities  of  the  two  diseases,.  Thus,  Pq  ■ V.  Similarly, 

H\) 

likelihood  ratio  for  two  disease  states  Is  simply  the  ratio  of  the 

likelihoods  for  the  two  diseases.  And  posterior  odds  are  the  ratio  of 

( 

posterior  probabilities.  Using  the  odds -likelihood  ratio  form  of  Bayes's 
Theorem  simplifies  things  a great  deal.  Let  be  the  prior  odds  of 
disease  1 against  disease  k.  Let  be  the  posterior  odds,  and  be  the 
likelihood  ratio  for  symptom  complex  j.  Then  Bayes’s  Theorem  becomes : 
fti  - When  one  is  dealing  with  conditionally  independent  symptom 

complexes,  the  equation  becomes:  flj  „ . , , n 

In  the  preceding  paragraph  It  was  suggested  that  the  likelihood  ratio 
of  a symptom  in  a given  symptom  cluster  be  reduced  by  the  amount  of  overlap 
that  it  had  with  others  in  the  cluster.  Applying  this  principle  to  likelihoods 
rather  than  likelihood  ratios  would  be  difficult  because  of  the  complexity 
of  computation,  but  is  stralghtfotward  when  one  uses  the  odds-likellhood 
ratio  form  of  Bayes's  Theorem. 


Sp«(Tifi,a  I'Tcyoaale  for  tha  Dijyeoian  jHagnooio 
The  specific  analyses  under  consideration  suffer  from  each  of  the 
difficulties  that  can  occur  In  trying  to  apply  a Bayesian  appv'oach  to 
organizational  diagnosis.  However,  the  techniques  described  previously 
can  be  used  to  alleviate  the  difficulties.  It  is  proposed  that  four 
different  tests  of  the  Bayesian  technique  be  made,  two  of  them  assuming 
conditional  Independence  and  two  of  them  dealing  with  Issues  of  non- 
Independence.  In  all  cases,  however,  it  Is  proposed  that  the  data  bonk 
be  divided  Into  a defining  sample  and  a validation  sample.  The  defining 
sample  should  consist  of  approximately  half  of  the  work  groups  in  t.he 
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data  bank.  Thus  the  defining  sarapk  N will  be  approximately  3»0G0, 
Classify  the  3,000  work  groups  using  the  criteria  developed  by  the  H-6RCHJP 
technique.  Create  tor  each  profile  type  a sot  of  descriptive  statistics 

and  relative  frequency  tables  for  each  of  the  16  indexes. 

♦ 

Teeta  Aevurmny  Tntfovitrutynm 

For  this  part  of  the  research,  assume  that  the  16  Indexes  are 
conditionally  Independent.  Ba:  ad  on  that  assumption,  the  researcher  is 
cor» earned  only  with  determining  the  probability  density  function  of  each 
Index  for  each  profile,  and  need  not  consider  the  problem  of  determining 
combined  density  functions  for  ail  16  Indexes.  Further,  It  Is 
proposed  that  the  researcher  use  the  odds-llkellhood  ratio  form  of 
Bayes's  Theorem,  which  means  that  the  ratios  of  the  density  functions 
rather  than  the  absolute  values  of  the  density  functions  are  the  values 
of  Interest.  The  taro  different  ways  of  developing  likelihood  ratio 
density  functions  will  Involve  two  different  families  of  continuous 
distributions . 

Hvthpd  1:  Ttya  Family  of  hon^l  Distributions.  For  each  profile, 
determine  th*  mean  and  standard  deviation  of  the  member  work  groups  on 
each  Index.  Assume  that  the  underlying  data  generating  distribution  is 
a normal  distribution  with  mean  and  variance  equal  to  tin  sample  mean 
and  unbiased  estimate  of  the  sample  veriapre.  The  result  will  be  for  each 

i 

Index  a set  of  normal  density  functions,  one  for  each  of  fche  17  profiles. 

1 

In  order  to  use  the  odds-llkellhwd  ratio  form  of  Bayes's  Theorem, 
select  one  profile  of  the  17  as  the  "star  lard,"  against  which  the 
remaining  16  profiles  will  each  be  compared.  Then  to  obtain  posterior 
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probabilities  across  the  17  profiles  for  ar\y  work  group  that  one  wants 
to  diagnose,  slnply  apply  the  odds -like  It  hood  ratio  form  of  the  theorem 
16  times,  and  convert,  the  posterior  odds  to  probabilities.  Under  the 
assumption  of  independence,  the  posterior  odds  for  any  profile  comparison 
will  simply  be  the  product  of  all  the  likelihood  ratios  (one  likelihood 

\ 

ratio  for  each  index)  times  the  prior  odds. 

Classify  each  of  the  defining  sample  work  groups  using  the  prior 
odds  and  likelihood  ratios  developed  using  the  normal  density  function 
assumption.  A further  test  would  be  to  classify  the  remaining  work 
group;,  those  In  the  validation  sample. 

Method  2:  The  Family  of  Beta  Distributes.  The  beta  distribution 
is  a uni  modal , continuous  distribution,  as  is  the  normal  distribution. 

It  has  some  characteristics  that  are  particularly  appealing  In  the  current 
situation.  The  range  of  scores  on  the  Survey  of  Organizations  Is  limited; 
values  may  range  between  one  and  five.  The  normal  distribution  assumes 
an  infinite  range,  so  it  is  known  a priori  that  the  normal  distribution 
can  never  exactly  fit  the  actual  distribution  of  work  group  scores. 

This  discrepancy  becomes  especially  severe  when  the  mean  of  the  distribution 
is  close  to  either  end  of  the  range,  since  the  actuit'  distribution  must 

} 

be  skewed  but  the  normal  distribution  is  symmetric.  ' 

The  beta  distribution,  however.  Is  defined  over  a closed  interval, 
such  M»  that  which  the  SOQ  scores  cover.  It  reflects  the  skewness  that 
1*  required  by  such  a range  restriction.  The  beta  distribution  is  defined 
by  two  Independent  parameters  which  are  uniquely  determined  by  the  mean  arid 
variance  of  the  distribution.  In  straightforward  fashion  one  may  compute 
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the  two  parameter  of  the  beta  distribution,  thus  obtaining  for  each  index 
on  each  profile  the  underlying  density  that  will  determine  the  likelihood 
ratios. 

Once  the  density  functions  have  been  determined,  proceed  as  for  the 
normal  distribution  test,  classifying  each  work  group  In  the  defining 
sample  and  the  validation  sample  according  to  the  posterior  odds  resulting 

i 

from  the  beta- family  likelihood  ration.  " 


I 
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Tea  tig  Av^uming  Nori-lndByendanm 

Two  methods  will  be  used  to  test  the  Bayesian  diagnosis  technique 
which  take  into  consl deration  the  issue  of  conditional  non- Independence. 
These  methods  should  be  applied  after  the  two  methods  assuming  independence 
have  bean  don*.  Select  the  better  family  of  distributions  to  generate  the 
underlying  probability  density  functions. 

For  both  methods,  the  first  step  is  to  obtain  a correlation  matrix  of 
all  16  Indexes.  From  this  matrix,  select  clusters  that  are  relatively 
uncormlated  with  each  other.  This  procedure  should  result  In  the 
reduction  of  16  non-independent  indexes  to  about  five  relatively  independent 
clusters. 

Method  3:  Creation  of  Grouped  Scores.  This  method  deals  with  the 
non -Independence  problem  by  reducing  the  data  to  a set  of  hopefully 
Independent  scores.  After  the  clusters  have  been  determined,  create  a 
single  score  for  each  cluster  that  Is  the  mean  of  the  scorn  of  thi 
Indexes  belonging  to  the  cluster.  Then  treat  the  nwt  scores  as  Independent 
data  and  develop  the  distribution  functions  as  described  in  the  section 
assuming  Independence.  Once  the  distribution  functions  have  bean 
determined,  carry  out  the  classification  tests  as  described  previously. 


} 
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Method  4;  Reduction  of  Likelihood  Ratios  by  Degree  of  Shared  Variability . 
Starting  with  the  relatively  small  number  of  independent  Index  clusters, 
lock  at. the  multiple  correlations  among  them.  This  can  be  done  by  selecting 
as  the  '"dependent"  variable  the  index  which  has  the  highest  correlation  to 
all  the  others.  Then,  treating  the  other  Indexes  In  the  cluster  as 
Independent  variables,  perform  a multiple  regression.  One  of  the  outputs 
of  the  multiple  regression  is  the  partial  correlation  of  each  variable 

from  which  one  can  compute  the  amount  of  variance  uniquely  accounted  for 
by  each  variable.  The  likelihood  ratio  for  a given  cluster  would  be  the 
product  of  each  index  likelihood  ratio,  reduced  to  the  amount  of  unique 
variance  It  contributes  to  the  overall  score. 

Example 

Suppose  that  Cluster  A is  composed  of  Index  X,  Index  Y,  and  Index  Z. 

Also  suppose  that  Index  X is  the  most  highly  correlated  with  each  of  the 
other  two  indexes.  Make  Index  X the  dependent  variable.  Suppose  that  the 
multiple  regression  of  X with  Y and  L results  in  a multiple  11  of  .86  and 
partial  correlations  of  .69  and  .08  for  Y and  Z respectively.  Then 
assume  that  the  marginal  likelihood  ratios  for  X,  Y,  and  Z are  L.^,  Ly , and 
L^.  then  the  combined  likelihood  ratio  for  the  cluster  would  be; 

Lc  - ( . B6 ) J L x x (69)  •’Ly  x (.08)2lz. 

The  multiple  regression  need  not  be  done  for  each  profile,  but  should 
be  done  only  once,  for  the  overall  defining  sample.  Tills  will  then  yield 
reduction  factors  for  each  index  cluster  that  will  be  used  in  all 
likelihood  ratios  involving  that,  cluster. 

This  procedure  will  result  In  downgraded  likelihood  ratios  for  each 
cluster.  These  modified  likelihood  ratios  will  then  be  treated  as 
conditionally  Independent  figures  In  Bayes's  Theorem. 


It  Is  apparent  that  in  any  non-trlvial  situation,  the  use  of  Bayes's 
Theorem  to  determine  the  probabilities  of  the  possible  profiles  given  the 


Index  values  Is  difficult  at  best.  The  two  most  difficult  problems  are  the 
almost  guaranteed  inadequacy  of  the  data  base  and  the  issue  of  conditional 
non- independence.  Modifying  the  technique  so  as  to  in  same  way  alleviate 
these  problems  leads  to  less  confidence  in  the  final  probabilities,  since 
they  are  based  on  subjective  estimates  that  are  only  "best  guesses"  about 
the  veridical  vatejs.  If  the  measure  of  goodness  of  the  technique  is 
its  distance  ironi  the  ideal  profiles  as  developed  by  H-GROUP,  th>>n  the 
"•jyesian  approach  will  assuredly  perform  less  well  than  a cM stance-based 
approach . 

If,  however,  other  criteria  of  goodness  are  used,  the  Bayesian 
approach  tiny  prove  to  be  more  useful  than  some  other  approaches.  One 
positive  aspect  of  the  Bayesian  approach  Is  that  it  does  not,  strictly 
speaking,  reach  a diagnosis  at  all.  The  final  output  is  a set  of 
f 'ooabi llties  over  all  possible  profiles.  Presumably,  the  diagnostician 
would  select  that  profile  that  has  the  highest  probability.  If,  however, 
the  ultimate  use  of  the  diagnosis  is  In  determining  treatment,  the 
diagnostician  may  want  to  do  more  than  make  a diagnosis.  He  may  want  to 
select  the  treatment  that  will  yield  the  highest  expected  return.  In  such 
a case,  he  should  not  ignore  all  the  less  likely  diagnoses,  but  should 
fb*  «vn*»r  1‘aH  valuer  nf  the  various  treatments  bv  the  likelihood 
that  they  are  to  be  applied  to  each  of  the  possible  disease  states. 


««*w.  MhTHWfltwUj  - .tfr,  w 
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Another  criterion  of  goodness  may  be  the  ability  of  a practitlonev  to 
make  a diagnosis  while  In  the  field.  If  he  is  restricted  to  a distance- 
type  model,  that  may  be  difficult  If  not  impossible.  With  the  Bayesian 
model,  determining  the  initial  set  of  likelihood  ratios  Is  the  difficult 
part;  once  the  values  have  been  determined,  however,  it  would  be  fairly 
simple  to  equip  the  practitioner  with  a sirple  Bayesian  equation  to  use 
In  the  field.  It  would  proably  require  the  use  of  a hand  calculator,  but 
not  a computer.  , - 

There  are  various  other  criteria,  such  as  computing  expense,  the 
amount  of  data  that  must  be  evaluated  in  c.der  to  reach  a diagnosis,  and 
the  ease  of  selecting  second  and  third  choices.  Whether  or  not  the  Bayesian 
approach  would  perform  well  under  those  criteria  is  certainly  testable. 


b 
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EXAMINATION  Of  TECHNIQUES 


Accuracy  Based  Criteria 

The  purpose  of  this  section  Is  to  present  five  different  criteria 
to  use  in  evaluating  classification  procedures,  each  of  which  Is  based  on 

* a 

a measure  of  accuracy.  In  later  sections,  non-accuracy  based  criteria 
are  discussed. 

The  four  different;  classification  procedures  to  be  Investigated  ate 
a decision  tree  ( DT)  . multiple  discriminant  function  (MI)I:) , a Bayes  rule  (B) , 
and  a straight  (least  squares)  distance  function  (DF) . The  typology  Into 
which  work  groups  are  to  be  assigned  has  been  discussed  earlier  in  this 
report.  Two  aspects  of  the  typology  and  Its  development  are  relevant  to 
the  investigation  here.  First,  it  has  been  derived  by  a clustering 
algorithm  which , in  effect,  groups  the  observations  (vectors  of  mean 
scores  from  work  groups)  so  as  to  minimize  the  variance  within  clusters. 

The  variance  metric  is  also  a distance  metric,  which  will  result  In  the 
same  classification  by  the  DF  for  a particular  work  group  as  you  would  get 
by  including  that  work  group  in  the  clustering  process  originally.  The 
consequence  is  that  we  will  use  the  distance  function  (DF)  classification 
as  the  correct  classification  For  any  particular  work  group. 

The  typology  developed  by  the  clustering  algorithm  allows  the  creation 
of  o vector  of  scores  from  the  averages  across  all  work  groups  within  the 
cluster  (Bowers  & llausser,  1975).  Thus,  we  can  think  ot  tne  typology 
containing  17  types  as  a set;  of  17  vectors.  That  is,  a type  Is  represented 
by  a single  vector  of  scores. 


Proportion  of  Correct  Clots  si  fixation 

The  classical  criterion  for  evaluating  classification  schemes  Is  the 
proportion  of  agreement  with  some  external  "expert"  opinion.  In  the 
present  situation*  is  discussed  above,  the  expert  opinion  will  be  that 
provided  by  the  distance  function  (OF).  However,  there  is  some  Interest 

i 

in  seeing  how  well  one  data  analytic  technique  is  able  to  reproduce  another 
data  analytic  technique  when  additional  criteria  (beyond  proportion  of 
agreement)  are  considered.  The  reader  is  referred  to  the  following 
sections  for  llscussion  of  other  criteria. 

As  a beginning,  we  propose  to  analyze  the  three  classification 
procedures,  MBF,  B,  and  OT,  by  comparing  the  proportions  of  agreement  wi th 
i>F.  The  straightforward  process  will  be  as  follows: 

for  each  of  MDF,  B,  and  DT,  compute  the  proportion  of  the 
(approximately)  3,100  work  groups  which  are  asslnned  to  the  same 
type  by  both  the  given  procedure  ami  OF. 

The  figures  obtained  from  the  above  analysis  can  be  viewed,  for 
each  procedure,  as  the  percentage  correct  within  each  type  of  the  typoloqy 
as  well  as  aggregated  across  all  types.  Such  a review  may  enhance  the 
application  of  the  technique  in  other  settings. 

\ 

Reproduce  the  Typo'logu 

Assume  a particular  classification  procedure  has  classified,  say,  k 
groups  Into  a single  type  within  the  typology.  It  then  would  be  possible 
to  compute  the  averages  fo-  those  k groups,  resulting  in  a single  vector 
of  Index  scores.  It  is  desirable  to  have  tills  vector  of  scores  for  the 
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assigned  groups  be  close  to  the  vector  of  scores  which  represent  this 
type,  A measure  of  accuracy  of  prediction  would  be  to  compute  the  sum  of 
the  distances  between  each  of  the  17  types  and  their  corresponding 
mean  vectors  from  the  groups  assigned  to  them. 

A criterion  for  evaluating  the  different  classification  schemes,  then, 
would  compare  the  sums  of  distances  between  the  typology  vectors  and  the 
group  means.  Because  of  the  use  of  a standard  distance  measure  to  compare 
the  type  vector  and  the  vector  of  average  scores  from  the  observations 
assigned  to  that  type,  the  distance  function  will  do  best  here,  as  well. 
However,  it  would  be  appropriate  to  compute  the  sum  of  the  distances 


nr  4-0 

Twi  u i 


o use  ao  m standard  against  others  which  could  bo  cow**>*r<?d 


The  procedure  to  implement  this  process,  using  all  (approximately) 
work  groups,  is  as  follows: 

For  each  of  MDF,  B.  DT,  and  OF: 

1.  Compute  a vector  of  average  values  for  each  set  assigned  to 
a type  by  the  procedure. 

2.  Compute  the  distance  between  this  vector  of  averages  and  the 
type  vector. 

3.  Sum  the  distances  across  all  types. 

For  each  of  MDF , B , and  DT : 

4.  Compute  the  ratio  of  the  sum  to  the  sym  for  DF„ 


< 
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Weighted  Dimenaional  Diatanoee 

The  classical  distance  metric  considers  each  of  the  dimensions  equally. 
It  Is  known  that  of  the  14  dimensions  resulting  from  the  S00  they  are 
not  all  of  equal  importance,  Merited*  then,  Is  the  consideration  of  a 
distance  metric  which  weights  the  dimensions  according  to  their  importance. 

One  possible  scheme  for  ordering  the  14  dimensions  would  be  to  use  the 
criterion  of  the  strength  of  the  relationship  between  the  single  dimension 
and  a measure  of  productivity.  We  will  refer  to  such  a rank-ordering  of 
the  dimensions,  and  the  weights  assigned  to  such  an  ordering,  as  the  per- 
formance ordering  and  performance  weights,  respectively.  Another  example 
of  a method  for  ordering  the  dimensions  is  to  do  so  according  to  the 
susceptibility  to  change.  The  different  dimensions  of  organizational 
functioning  as  defined  by  the  SQO  respond  differentially  to  planned  change 
activities,  and  sane  are  more  difficult  to  change  than  others.  The 
weights  resulting  from  ordering  according  to  this  criteria  will  be 
called  change  weights. 

Algebraically,  the  classical  distance  Is  given  by  d WiKyfV7 
where  y and  ?.  are  the  two  points  of  Interest.  The  weighted  distance  is 
given  by  dw  - *\j  ^ (y^ -S^  )2 , where  Wj,  wz , ...  wn  are  the  weights  assigned 
to  the  dimensions. 

I 

The  actual  choice  of  the  weights  can  be  based  on  several  possible 
criteria.  The  underlying  premise  Is  that  the  weights  reflect  the  relative 
Importance  of  the  dimensions.  As  mentioned  above,  it  is  possible  to  rank 
order  the  14  dimensions  of  the  S00  according  to  their  correlation,  pairwise, 
with  some  productivity  criterion.  One  such  criterion  Is  total  variable 
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expense  (TVE),  measured  as  a percentage,  which  reflects  actual  costs 

compared  to  a standard.  Data  are  available  on  the  correlations  between 

work  group  SOD  scores  and  TVE,  for  a large  niwtoer  of  work  groups  in  a 

variety  of  organizations.  These  correlations,  r^  (between  the  1th  Index 

t TVE)  can  be  used  as  weights  according  to  w,  ■ r1  where  c Is  a constant 

' c* 

chosen  so  that  Ew^~l. 

Considerable  research  has  been  done  on  changing  organizations,  with 

specific  measurement  of  that  change  by  the  SCO  (Bowers  & H.ausser,  1977; 

Bowers,  1973;  Franklin,  1976).  As  a consequence  of  the  work  done  In 

these  past  Investigations,  we  are  able  to  obtain  change  scores  on  each  of 

the  14  SDO  indexes  over  a iarye  number  uf  work  groups  without  mud: 

additional  work.  Let  these  change  scores  be  represented  by  gi,  g2,  ....  g^- 

m 

Define  w.-gl/c,  again  where  c Is  chosen  so  that  l w,=*l.  These  then  will 
1 1-1  1 

be  the  change  weights. 

The  procedure  to  compare  the  classification  techniques  will  be  as 
follows: 

1.  Establish  a set  of  performance  weights  and  a set  of  change  weights. 

2.  Confute  vectors  of  average  values  for  each  set  assigned  to  each 

type  by  each  classification  procedure. 

3.  Compute  the  weighted  distance,  d^,  for  each  set  of  weights, 
between  the  average  for  the  type  and  tire  type  vector. 

4.  Sum  the  weighted  distance  across  the  17  types,  fgr  each  pro- 


v 


i 

i 

t 

} 
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UJUUiw,  mid  rv.  Lull,  Of 

It  Is  noted  that  this  procedure  Is  similar  to  that  suggested  under  the 
heading.  Reproduce  the  Typology. 


S 
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Zr  - o-Ona  Count 

In  assigning  a work  group  to  a single  type  withi  1 the  typology, 
two  different  patterns  of  work  group  scores  m$y  lead  to  tha  same  distance 
from  the  type.  Consider  the  following  example,  simplified  to  reflect 


only  four  dimensions, 
type 

work  group  1 
work  group  2 


(2,2 ,2,1) 
(4.4,4, 3) 
(2 ,2 .2 ,5) 


distance  « 4 
distance  « 4 


One  approach  to  overcoming  the  above  situation  Is  to  use  the  weighted 
dimensional  distances  described  earlier.  An  alternative  approach  is  to 
count  the  number  of  dimensions  for  which  the  assigned  group  is  significantly 
different  from  the  type.  If  we  define  significantly  different  as  being 
greater  than  or  equal  to  two  In  the  above  example,  the  counts  would  be 
four  for  work  group  1 and  one  for  the  second  work  group.  Mathematically* 
this  counting  can  be  represented  as  the  sum  of  the  values  , 

fi.  if  n.-Mi* 

C tt\  1 

1 |o.  If  |Yi-Xi|<s. 

Here  s Is  the  value  defining  significantly  different,  and  Y_  and  X.  are  the 

vectors  representing  the  poll  ts  of  Interest.  He  will  let  O Ec^.  Then 

C may  be  commuted  for  each  work  group  assigned  to  a particular  type,  anJ  ; 

either  an  average  computed  for  each  type,  or  suitined  across  all  work  groups 

ami  vypissi,  wiisvl  CT.  Tisc  J.vSCC  Z*  Z -fc  Sul'S  tU  *-*>*"*'»  of 

relative  magnitude  or  theoretical  aryument.  For  the  present  study,  we 

prop  j$e  s be  chosen  as  a measure  of  the  average  variability  of  the  SOO 

Indexes.  Letting  q.1  be  the  varla*  «*  within  types  of  the  1th  Index 

averaged  across  types.  Then  let  ~Th  »'i  where  a may  take  on  the 

se«*A  1 T ql 

\|  i*i  1"i 

■alues  1 , l .ft,  and  2.0. 
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Thus,  CT  emu  Id  serve  as  a criterion  for  evaluating  the  accuracy  of 
classification.  The  procedure  involved  would  be: 

1.  Assign  a value  tc  s,  the  significant  difference. 

2.  For  each  procedure,  compute  C for  each  work  group. 

3.  Compute  CT  for  each  procedure. 

Alternatives  could  Include  examining  different  values  of  s,  or  even 
different  values  for  each  dimension,  say  s^.  This  latter  possibility 
has  the  potential  for  weighting  the  dimensions. 


IK. 


i tu  of  Mieolaaaifiaati.c 


One  drgunenL  dyoiiibc.  cite  u>e  ef  the  classical  frequency  of  correct 

classifications  is  that  It  treats  all  wfsclussliflCvUlOns  as  equal.  That 

i 

is,  there  Is  no  distinction  between  miscl issl tying  a particular  work  group 
Into  any  of  the  k-1  Incorrect  types  in  & typology  of  k types.  It  Is  not 
unusual,  however,  for  the  cost  of  mlsclassificatlon  to  be  widely  different 
across  the  k-1  Incorrect  types,  as  well  as  being  contingent  on  the  correct 
type.  For  example,  Incorrectly  diagnosing  an  individual  with  a severely 
sprained  ankle  as  having  a broken  ankle  or  having  leukemia  has  different 
costs;  additionally,  incorrectly  diagnosing  an  encephalitis  case  as  a 
broken  ankle  or  lekemia  has  yet  dt  Tenant  costs. 

An  alternative  to  the  straight  proportion  of  correct  classification 
criterion  of  accuracy  Is  one  which  allows  for  differential^  costs  of 
i.i'iwliCi-ifl Citlsn.  In  ***  'two  dl  f farwnt.  coatinq  models 

for  mlsclassificatlon  for  inclusion  in  the  present  study* 


.fr. 

I'W" 


*•«***  -mwnuj^i  km  ■fmin'iV nrriiaiWd-fiTu ■hdFfrg?fof3fri{fiW.)'i 
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1.  That  a cost  of  0 be  assigned  to  any  mlsclasst  flcatlon  of  a 
work  group  Into  a category  which  calls  for  (a)  treatment 
If  so  does  the  correct  category,  or  (b)  non-treatment  If 
so  does  the  correct  category;  otherwise  the  cost  Is  1. 

2.  That  a cost  of  0 be  assigned  to  any  mis  classification  of 
work  group  Into  a category  which  calls  for  a treatment  which 

l 

y 

Is  known  to  have  a positive  effect,  on  the  correct  category, 
and  a cost  of  1 If  the  Incorrectly  selected  category  coils 
for  a treatment  which  Is  known  to  have  no  effect,  or  a negative 
effect,  on  the  correct  category. 

These  are  only  two  models  of  a vast  number  of  possibilities  for  costing 
mlsclass  iff  cation.  However,  they  are  reflective  of  the  primary  concerns 
of  tnlsclassifi cation  — the  application  of  Inappropriate  or  harmful 
treatments. 

Bowers  and  Hausser  (1977)  examined  the  effects  of  different  change 
strategies  on  each  of  the  17  groups  of  the  typology.  They  rated  the 
effects  either  as  negative,  neutral,  or  positive.  For  number  1 above. 

It  is  possible  to  define  a work  group  calling  for  treatment.  Otherwise, 
th*  work  group  calls  for  no  treatment.  The  results  of  Bowers  and  Hausser 

(1977)  can  also  be  used  to  establish  the  appropriate  pattern  of  costs  for 

i l 

number  2 above.  ( 

Hotationally , the  above  procedures  can  be  generalized  as  follows: 
let  the  typology  contain  k types.  Let  C^j  represent  the  cost  of  diagnosing 
type  1 Into  category  J.  Then  C^O,  but  we  need  not  have  C^j  equal.  The 
ocit  matrix  Is  then  a k.  x k matrix,  given  by: 
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row  represents  true  state 


column  raprasan  s 

classified  state 

Cii  C u Cu  ....  Ci^ 

Cj  i Cj2  C* a • • • • 


Cki  °k*  Ck*  * * * ’ Sck 

The  trad'tlonal  accuracy  cri  terion  of  frequency  of  correct’ classification 


has  cost  matrix 


0 1 I ....  1 

1 0 1 ....  1 
1 1 0 ....  1 


1 1 1 ....  0 

For  procedure  1 above,  assume  that  the  first  t types  call  for 
treatmant,  and  that  the  last  k-t  types  indicate  .to  treatment.  (We  may 
consider  treatment  as  a major  Intervention  Into  the  life  of  the  work 
group.)  Than  the  cost  matrix  for  procedure  1 Is  given  by 


..  \ 

r'o  0 0 ..O'  3 T7..T 
I 0 0 0 ...  0 ] 1 ...  1 


L 0 0 0 ...  0 1 1 ...  1 

1 1 1 ...  1 0 0 ...  0 

1 1 1 ...  1 0 0 ...  0 


75 


If  we  let  be  the  frequency  (relative)  of  classifying  true  type  i 


as  type  j,  the  accuracy  Index  A is  given  by 


A * 1 ,j,  Cij  fij 


which  one 


wishes  to  minimise. 

For  the  classical  procedure,  A Is  just  tha  total  frequency  of  mlsclassi- 
fl  cation. 

A second  variation  In  the  cost  of  mlsclassifi cation  is  naturally 
available  using  a Bayes  classification  procedure.  When  using  such  a 
procedure,  the  result  is  a set  of  k (posterior)  probabilities  that,  the 

work  group  Is  of  type  1,  i * 1,  2 k.  The  generally  accepted  process 

is  to  classify  the  work  group  Into  the  type  for  which  the  probability 
1;  the  However,  it  is  possible  to  say  that,  with  much  higher 

probability,  tha  he  work  group  belongs  to  one  oF  the  three,  say,  types 
with  th'j  three  largest  probabilities.  (Clearly  the  selection  of  three  Is 
arbi trary . ) 

Notationally , the  above  can  be  developed  sequentially  from  the  earlier 
model.  Let  bn1j  be  the  cost  of  classifying  the  n^1  work  group,  which  Is 
actually  of  type  1 , into  type  j.  (Note  that  we  don't  have  and 

b^,  but  only  bn1j  for  one  pai  ’ of  values  I,  j.)  In  effect, 

Then  we  have 


# work  group: 
E b_ 


n»l 


'nlj 


' ij 


If  one  wants  to  consider  the  two  most  likely  groups  of  classification, 
then,  notationally,  let 

bw 


nl  jk 


<:ij  * c1k.  jfi 


2 

0, 


Ml 

otherwise. 
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and  the  three  most  likely  groups  procedure  has  the  following  notation: 

brtjH " hilbalhi,  jn 

3 k^1 

1^1 

■ 0 otherwise 

Here  bn1jkl  represents  the  cost  for  actually  having  classified  the  nth 
work  group,  which  Is  of  type  1,  as  type  j or  k or  1.  The  cost  is  zero 
If  any  of  j,  k,  or  l are  correct,  Clearly  one  Is  interested  in  minimizing 


# work  groups 
B * Z 

n«i  n1*1kl 

While  the  Bayes  approach  lends  Itself  naturally  to  classifying  a work  group 
as  being  one  of  a set  of  three  types  (rather  than  a single  type),  it  Is 
possible  to  do  this  with  other  classification  procedures  as  well.  For 
example,  with  the  distance  statistic,  which  assigns  a work  group  to  the  type 
to  which  It  is  closest,  it  Is  possible  to  define  a set  of  three  types  by 
choosing  the  three  types  to  which  it  Is  closest. 

In  that  the  approach  by  deflnl tl on  gives  the  right  answer.  It  is 
appropriate  to  ask  why  classify  Into  o set  of  three  types.  However,  as 
discussed  elsewhere  In  this  report,  other  factor's  such  as  reduced  data 
sets,  ease  of  computation,  etc.  lead  to  continued  consideration  of  using 
the  distance  procedure  in  this  way. 

We  then  define  a third  cost  of  ml ucl ass  1 fi cation  procedure  as 
3.  That  a cost  of  0 be  a signed  to  any  classification  of  a work 
group  Into  a aet  of  three  types  where  one  or  tne  tnree  is  uie 
correct  type,  and  otherwise,  the  cost  is  1. 
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Non-Accuracy  Based  Criteria 

Information  Required  to  Make  a Decrioicn 

One  criterion  by  which  one  can  compare  diagnostic  techniques  is  the 
E Information  required  to  reach  a diagnosis.  If  two  techniques  perform 

equally  well  when  the  full  set  of  indexes  is  used  in  the  diagnosis,  then 
It  will  be  to  the  diagnostician's  advantage  to  w « the  one  which  requires 

i 

less  Information  to  reach  a decision.  Collecting  and  processing  information 
Is  costly,  In  terms  of  money,  time,  and  complexity  of  processing.  For 
Instance,  if  one  can  obtain  results  using  three  pieces  of  information 
that  are  as  good  as  the  results  using  five  pieces  of  Information,  it  will 
clearly  be  advantageous  to  use  only  three  pieces  of  Inf  urination. 

The  most  obvious  way  of  reducing  the  amount  of  required  information 
Is  to  find  an  appropriate  way  to  reduce  the  nunfeer  of  indexes  that  ure 
used  In  the  diagnostic  process.  There  are  two  ways  of  reducing  the  number 
of  indexes.  The  first  is  to  discard  or  eliminate  indexes  that  have  been 
shown  to  be  unnecessary  for  the  diagnosis.  The  second  is  to  combine 
several  indexes  into  "super- indexes."  For  Instance,  it  might  be  possible 
to  combine  the  four  peer  leadership  Indexes  into  a sinyle  Index. 

First,  consider  reducing  Information  by  deleting  selected  Indexes, 

In  order  to  test  the  feasibility  of  reducing  the  information  required  to 
reach  a diagnosis  in  that,  way,  the  following  steps  are  proposed,  using  the 
defining  sample  from  the  data  set.: 
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1.  For  the  Multiple  Discriminant  Function,  set  a criterion  cut-off 
point  for  the  weights  of  the  variables.  Eliminate  from  the 
diagnostic  equation  those  Indexes  whose  weights  are  below  the 
cut-ofi  point.  Classify  the  defining  sample  groups  based  on  the 
abbreviated  equation.  Compare  the  H-GROUP  results  and  the  complete 
equation  MOF  results,  using  proportion  of  correct  classification 
as  the  criterion.  If  time  and  resources  allow,  this  could  be 
a step-wise  procedure,  in  which  the  indexes  are  deleted  one 
at  a time,  until  a proportion-of-correct-classifi cation  cut-off 
has  been  passed. 

2.  For  the  Bayesian  method,  the  indexes  to  be  eliminated  would 

• VI 
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be  those  that  discriminate  least  among  profile  types,  l.e.,  those 
whose  likelihoods  do  not  vary  greatly  among  profile  types. 

Each  Index  has  associated  with  it  a distribution  of  likelihoods 
for  every  profile  type.  The  current  study  proposes  to  determine 
those  likelihoods  not  from  the  relative  frequencies  of  cases  in 
the  cells,  but  by  deriving  the  assumed  underlying  distributions 
characterized  by  the  mean  and  standard  deviation  of  the  work 
group  scores  In  each  profile  type.  This  mans,  for  instance, 
that  Index  A har  associated  with  It,  a mean  and  standard  deviation 
of  group  scores  for  each  of  profile  one,  profile  two,  profile 
three,  etc.  One  measure  of  the  degree  of  dissinrl  larily  among 
tne  prdn  ies  is  the  standard  error  of  the  mean,  or  in  other 
words,  the  standard  deviation  of  the  mean  scores  across  profl’es. 


The  rule  for  elimination  of  indexes  would  be  to  select  a criterion 
scoie,  and  to  discard  all  indexor.  whose  standard  error  falls 
below  that  cut-off. 

As  was  suggested  for  MDF,  the  criterion  would  be  the 
proportion  ot  correct  classifications  using  the  degraded 
equation.  The  procedure  could  also  be  applied  step-wise,  if 
resources  should  permit. 

3.  For  the  Decision  Tree  technique,  the  selection  of  easily 

deletable  Indexes  should  be  apparent  as  the  tree  is  constructed. 
Again,  one  would  select  those  indexes  which  least  accurately 
discriminate  among  the  profile  types.  The  degraded  decision 
tree  would  be  tested  using  the  proportion- of- correct  classification 
criterion.  This  will  be  discussed  more  fully  In  the  technical 
report  on  the  decision  tree. 


The  second  method  of  Index  reduction  Is  the  combining  of  indexes 


In  order  to  test  the  feasibility  of'  combining  Indexes,  one  would  need 
to  go  back  to  the  data  set  and  perform  some  clustering  procedures. 

With  new  clusters,  one  would  again  perform  H-GROUP  to  derive  a.  new  set 
of  classification  equations.  Since  this  would  be  a very  large  study  in 
Itself,  we  plan  not  t.o  look  at  Information  reduction  of  this  type  in 
this  project. 


or, 
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Amount  of  Data  Hequirud  to  Deueloy  th&  Diagnoetia  Frooena 

Of  Interest  to  the  researcher  is  not  only  the  amount  of  data  that  must 
be  processed  to  obtain  a diagnosis*  but  also  the  amount  of  data  required 
to  generate  accurate  diagnostic  processes. 

All  of  the  techniques  to  be  tested  require  some  kind  of  historical 
data  base,  from  which  the  diagnostic  process  is  generated.  In  all 
Instances,  larger  data  bases  should  provide  more  accuracy  than  smaller 
ones.  If  one  technique  can  be  generated  quite  accurately  from  a smaller 
data  base  than  another,  that  technique  would  be  preferable,  since  It  would 
be  more  cost  effective. 

In  order  to  test  the  relative  strength  of  each  technique  on  this 
criterion,  we  propose  to  draw  a random  sample  (25%)  from  the  defining 
sample,  developing  the  MDF * Bayes,  and  Decision  Tree  models  from  It. 

We  will  then  compare  the  models  against  the  broader  based  models  to  test 
relative  efficiency. 
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Ease,  of  Caloulation 

The  ideal  diagnosis  technique  is  not  only  accurate,  but  relatively 
simple  to  perform.  In  the  organizational  diagnosis  situation,  the 
diagnostician  may  very  well  be  a change  agent  who  must  make  treatment 
recommendations  while  on  site  or  otherwi  out  of  contact  with  computer 
facilities. 

Three  ease-of-calculatlon  factors  are: 

1.  Can  be  calculated  on-site  versus  ^n  a central 
location. 

2.  Can  be  done  with  a hand  calculator  or  by  hand  versus 
requiring  EDP  facilities. 

3.  Few  things  to  be  calculated  versus  many  things  to  be  calculated. 

While  not  Identical,  these  three  factors  are  obviously  not  completely 
Independent  of  one  another. 

The  comparison  of  diagnostic  techniques  on  these  factor's  will  be  a 
matter  of  subjective  judgment.  It  is  proposed  that  after  the  optimal 
solutions  have  been  determined  for  each  diagnosis  technique,  the  researcher’s 
create  a table  Indicating  their  assessnent  of  each  technique  on  the  ease 
of  calculation  factors.  The  diagnostician  will  then  be  able  to  select 
among  the  techniques  when  ease  of  calculation  is  an  Important  aspect  ot 
a project. 


SUMMARY 


This  report  both  introduces  the  topic  of  research  on  organizational 
diagnosis  and  discusses  the  methodological  issues  involved  in  it. 

It  then  describes  the  analyses  to  be  reported  in  subsequent  technical 
reports  in  the  series. 

Any  objective  review  of  tNe  literature  of  organizational  development 
and  change  reveal  that  organizational  diagnosis  is  less  than  a consummate 
skill  in  profes  ional  practice.  FI  dings  from  the  general  field  of 
assessment  and  lassi fi cation  indicate  that  an  acceptable  procedure 
for  organizational  diagnosis  ought  be  statistical,  rather  than  clinical 
or  judgmental. 

Diagnoses  are  really  probability  statements  comparing  a sequence 
of  observed  symptoms  or  characteristics  with  hypothesized  symptom 
sequences  derived  from  past  research. 

The  Survey  of  Organizations  data  Tank  provides  a ready  resourco  for 
systematically  examining  different  statistical  methods  of  organizational 
diagnosis.  To  its  approximately  5,600  civilian  work  groups,  representing 
a broad  array  of  Industrial  and  governmental  settings  and  levels,  are 
added  520  ml  11  taryi work  groups  drawn  In  past  studies  from  the  Navy  and 
Army. 

Four  methods  of  diagnostic  classification  are  to  be  examined: 
distance  function,  multiple  discriminant  function,  Bayesian,  and  decision 


tree. 


83 


Succeeding  technical  reports  in  the  series  cover  the  following 


analyses : 


1.  Assignment  of  all  6,119  groups  to  one  of  the  17  types  in 

the  Bowers  and  Hausser  typology  by  a distance  function  method. 
This  procedure  will  constitute  the  "expert  diagnosis"  \ 

criterion. 

2.  Calculation  of  discriminant  function  weights  from  a developmental 
random  half-sample  of  3,000  (approximately)  groups. 

3.  Development  of  likelihood  functions  to  use  in  Bayes's  Theorem, 
under  the  following  conditions: 

(a)  Independence  of  index  measures: 

(i)  Assuming  an  underlying  normal  distribution. 

(11)  Assuming  an  underlying  beta  distribution. 

(b)  Non-independence  of  Index  measures. 

4.  Assignment,  by  a decision  tree  procedure. 

5.  Comparison  of  the  techniques  In  terms  of  accuracy-based 
crl  teria: 

(a)  Proportion  of  correct  classification  for  MDF,  Bayes, 

and  DT  methods  compared  to  a distance  function  criterion. 

(b)  „ Reproduction  of  the  typology. 

(c)  Weighted  dimensional  distances. 

(d)  Zero -one  count. 


— -***r*  -*% I*- ,,*.*-**« hi 


6.  Comparison  of  the  techniques  in  terme  of  severity  of 
classification.  defined  as  casts  associated  with  inappropriate 
treatment  recoawmndetl ants. 

7.  Comparison  of  the  tedmiqiMes  in  term  of  information  required 
by  {*>  eliminating  index**,  (b>  collapsing  Mnu«,  and 

(c)  > angling  of  respondents. 

8.  Comparison  of  the  techniques  in  terms  of  ease  of  calculation. 


« 
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